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activity and for screening compounds that may be used in the treatment of GENSET- related disorders. 
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HUMAN CDNAS AND PROTEINS AND USES THEREOF 
DESCRIPTION 

5 RELATED APPLICATIONS 

The present application claims priority from U.S. Provisional Application Serial No. 60/224,009, 
filed August 7, 2000; U.S. Provisional Application Serial No. 60/293,574, filed May 25, 2001; 
U.S. Provisional Application Serial No. 60/298,698, filed June 15, 2001; U.S. Provisional 
Application Serial No. 60/302,277, filed June 29, 2001; and U.S. Provisional Application Serial 
10 No. 60/305,456, filed July 13, 2001, the disclosures of which are incorporated herein by reference 
in their entireties. 

FIELD Off TH E INVENTION 

The present invention is directed to GENSET polypeptides, fragments thereof, and the 
regulatory regions located in the 5'- and 3'-ends of the genes encoding the polypeptides. The 

1 5 invention also concerns polypeptides encoded by GENSET polynucleotides and fragments thereof. 
The present invention also relates to recombinant vectors including the polynucleotides of the 
present invention, particularly recombinant vectors comprising a GENSET gene regulatory region 
or a sequence encoding a GENSET polypeptide, and to host cells containing the polynucleotides of 
the invention, as well as to methods of making such vectors and host cells. The present invention 

20 further relates to the use of these recombinant vectors and host cells in the production of the 
polypeptides of the invention. The invention further relates to antibodies that specifically bind to 
the polypeptides of the invention and to methods for producing such antibodies and fragments 
thereof. The invention also provides for methods of detecting the presence of the polynucleotides 
and polypeptides of the present invention in a sample, methods of diagnosis and screening of 

25 abnormal GENSET polypeptide expression and/or biological activity, methods of screening 
compounds for their ability to modulate the activity or expression of the GENSET polypeptides, 
and uses of such compounds. 

BACKGROUND OF THE INVENTION 
cDNAs encoding secreted proteins or fragments thereof represent a particularly valuable 

30 source of therapeutic agents. Thus, there is a need for the identification and characterization of 
secreted proteins and the nucleic acids encoding them. 

In addition to being therapeutically useful themselves, secretory proteins include short 
peptides, called signal peptides, at their amino termini which direct their secretion. These signal 
peptides are encoded by the signal sequences located at the 5' ends of the coding sequences of 

35 genes encoding secreted proteins. Because these signal peptides will direct the extracellular 
secretion of any protein to which they are operably linked, the signal sequences may be exploited 
to direct the efficient secretion of any protein by operably linking the signal sequences to a gene 
encoding the protein for which secretion is desired, hi addition, fragments of the signal peptides 
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called membrane-translocating sequences may also be used to direct the intracellular import of a 
peptide or protein of interest. This may prove beneficial in gene therapy strategies in which it is 
desired to deliver a particular gene product to cells other than the cells in which it is produced. 
Signal sequences encoding signal peptides also find application in simplifying protein purification 
5 techniques. In such applications, the extracellular secretion of the desired protein greatly facilitates 
purification by reducing the number of undesired proteins from which the desired protein must be 
selected. Thus, there exists a need to identify and characterize the 5' fragments of the genes for 
secretory proteins which encode signal peptides. 

Sequences coding for secreted proteins may also find application as therapeutics or 

10 diagnostics. In particular, such sequences may be used to determine whether an individual is likely 
to express a detectable phenotype, such as a disease, as a consequence of a mutation in the coding 
sequence for a secreted protein. In instances where the individual is at risk of suffering from a 
disease or other undesirable phenotype as a result of a mutation in such a coding sequence, the 
undesirable phenotype may be corrected by introducing a normal coding sequence using gene 

1 5 therapy. Alternatively, if the undesirable phenotype results from overexpression of the protein 
encoded by the coding sequence, expression of the protein may be reduced using antisense or triple 
helix based strategies. 

The secreted human polypeptides encoded by the coding sequences may also be used as 
therapeutics by administering them directly to an individual having a condition, such as a disease, 

20 resulting from a mutation in the sequence encoding the polypeptide. In such an instance, the 
condition can be cured or ameliorated by administering the polypeptide to the individual. 

In addition, the secreted human polypeptides or fragments thereof may be used to generate 
antibodies useful in determining the tissue type or species of origin of a biological sample. The 
antibodies may also be used to determine the cellular localization of the secreted human 

25 polypeptides or the cellular localization of polypeptides which have been fused to the human 
polypeptides. In addition, the antibodies may also be used in immunoaffinity chromatography 
techniques to isolate, purify, or enrich the human polypeptide or a target polypeptide which has 
been fused to the human polypeptide. 

SUMMARY OF THE INVENTION 

30 The present invention provides a purified or isolated polynucleotide comprising, consisting 

of, or consisting essentially of a nucleotide sequence selected from the group consisting of: (a) the 
sequences of the odd SEQ ID NOs:l-l 1 1; (b) the sequences of clone inserts of the deposited clone 
pool; (c) the coding sequences of the odd SEQ ID NOs:l-l 1 1; (d) the coding sequences of the 
clone inserts of the deposited clone pool; (e) the sequences encoding one of the polypeptides of the 

35 even SEQ ID NOs:2-l 12; (f) the sequences encoding one of the polypeptides encoded by the clone 
inserts of the deposited clone pool; (g) the genomic sequences coding for the GENSET 
polypeptides; (h) the 5 ! transcriptional regulatory regions of GENSET genes; (i) the 3' 
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transcriptional regulatory regions of GENSET genes; (j) the polynucleotides comprising the 
nucleotide sequence of any combination of (g)-(i); (k) the variant polynucleotides of any of the 
polynucleotides of (a)-(j); (1) the polynucleotides comprising a nucleotide sequence of (a)-(k), 
wherein the polynucleotide is single stranded, double stranded, or a portion is single stranded and a 
5 portion is double stranded; (m) the polynucleotides comprising a nucleotide sequence 

complementary to any of the single stranded polynucleotides of (1). The invention further provides 
for fragments of the nucleic acids and polypeptides of (a)-(m) described above. 

Further embodiments of the invention include purified or isolated polynucleotides that 
comprise, consist of, or consist essentially of a nucleotide sequence at least 70% identical, more 

10 preferably at least 75%, and even more preferably at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, 
or 99% identical, to any of the nucleotide sequences in (a)-(m) above, e.g. over a region of 
contiguous nucleotides at least about any one integer between 10 and the last integer representing 
the last integer representing the last nucleotide of a specified sequence of the sequence listing, or a 
polynucleotide which hybridizes under stringent hybridization conditions to a polynucleotide of the 

15 present invention including (a) through (m) above. 

The present invention also relates to recombinant vectors, which include the purified or 
isolated polynucleotides of the present invention, and to host cells recombinant for the 
polynucleotides of the present invention, as well as to methods of making such vectors and host 
cells. The present invention further relates to the use of these recombinant vectors and recombinant 

20 host cells in the production of GENSET polypeptides. The present invention further relates to a 
polynucleotide of the present invention operably linked to a regulatory sequence including 
promoters, enhancers, etc. 

The invention further provides a purified or isolated polypeptide comprising, consisting of, 
or consisting essentially of an amino acid sequence selected from the group consisting of: (a) the 

25. full length polypeptides of even SEQ ID NOs:2-l 12; (b) the full length polypeptides encoded by 
the clone inserts of the deposited clone pool; (c) the epitope-bearing fragments of the polypeptides 
of even SEQ ID NOs:2-l 12; (d) the epitope-bearing fragments of the polypeptides encoded by the 
clone inserts contained in the deposited clone pool; (e) the domains of the polypeptides of even 
SEQ ID NOs:2-l 12; (f) the domains of the polypeptides encoded by the clone inserts contained in 

30 the deposited clone pool; (g) the signal peptides of the polypeptides of even SEQ ID NOs:2-l 12 or 
encoded by the human cDNAs of the deposited clone pool; (h) the mature polypeptides of even 
SEQ ID Nos:2-l 12 or encoded by the human cDNAs of the deposited clone pool; and (i) the allelic 
variant polypeptides of any of the polypeptides of (a)-(h). The invention further provides for 
fragments of the polypeptides of (a)-(i) above, such as those having biological activity or 

3 5 comprising biologically functional domain(s). 

The present invention further includes polypeptides with an amino acid sequence with at 
least 70% similarity, and more preferably at least 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 
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99% similarity to those polypeptides described in (a)-(i), or fragments thereof, as well as 
polypeptides having an amino acid sequence at least 70% identical, more preferably at least 75% 
identical, and still more preferably 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identical to 
those polypeptides described in (a)-(i), or fragments thereof, e.g. over a region of amino acids at 
5 least any one integer between 6 and the last integer representing the last amino acid of a specified 
polypeptide sequence of the sequence listing. The invention further relates to methods of making 
the polypeptides of the present invention. 

The present invention further relates to transgenic plants or animals, wherein said 
transgenic plant or animal is transgenic for a polynucleotide of the present invention and expresses 
10 a polypeptide of the present invention. 

The invention further relates to antibodies that specifically bind to GENSET polypeptides 
of the present invention and fragments thereof as well as to methods for producing such antibodies 
and fragments thereof. 

The invention also provides kits, uses and methods for detecting GENSET gene expression 
15 and/or biological activity in a biological sample. One such method involves assaying for the 
expression of a GENSET polynucleotide in a biological sample using the polymerase chain 
reaction (PCR) to amplify and detect GENSET polynucleotides or Southern and Northern blot 
hybridization to detect GENSET genomic DNA, cDNA or mRNA. Alternatively, a method of 
detecting GENSET gene expression in a test sample can be accomplished using a compound which 
20 binds to a GENSET polypeptide of the present invention or a portion of a GENSET polypeptide. 
The present invention also relates to diagnostic methods and uses of GENSET 
polynucleotides and polypeptides for identifying individuals or non-human animals having 
elevated or reduced levels of GENSET gene products, which individuals are likely to benefit from 
therapies to suppress or enhance GENSET gene expression, respectively, and to methods of 
25 identifying individuals or non-human animals at increased risk for developing, or at present 
having, certain diseases/disorders associated with GENSET polypeptide expression or biological 
activity. 

The present invention also relates to kits, uses and methods of screening compounds for 
their ability to modulate (e.g. increase or inhibit) the activity or expression of GENSET 
30 polypeptides including compounds that interact with GENSET gene regulatory sequences and 
compounds that interact directly or indirectly with a GENSET polypeptide. Uses of such 
compounds are also within the scope of the present invention. 

The present invention also relates to pharmaceutical or physiologically acceptable 
compositions comprising, an active agent, the polypeptides, polynucleotides or antibodies of the 
35 present invention, as well as, typically, a physiologically acceptable carrier. 

The present invention also relates to computer systems containing cDNA codes and 
polypeptide codes of sequences of the invention and to computer-related methods of comparing 
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sequences, identifying homology or features using GENSET polypeptides or GENSET 
polynucleotide sequences of the invention. 

In another aspect, the present invention provides an isolated polynucleotide, the 
polynucleotide comprising a nucleic acid sequence encoding a polypeptide of the present invention 
5 including the polypeptide of (a) through (i) above . 

In another aspect, the present invention provides a non-human transgenic animal 
comprising the host cell. 

In another aspect, the present invention provides a method of making a GENSET 
polypeptide, the method comprising a) providing a population of host cells comprising a herein- 
1 0 described polynucleotide and b) culturing the population of host cells under conditions conducive 
to the production of the polypeptide within said host cells. 

In one embodiment, the method further comprises purifying the polypeptide from the 
population of host cells. 

In another aspect, the present invention provides a method of making a GENSET 
15 polypeptide, the method comprising a) providing a population of cells comprising a polynucleotide 
encoding a herein-described polypeptide; b) culturing the population of cells under conditions 
conducive to the production of the polypeptide within the cells; and c) purifying the polypeptide 
from the population of cells. 

In another aspect, the present invention provides a biologically active polypeptide encoded 
20 by any of the herein-described polynucleotides. 

In one embodiment, the polypeptide is selectively recognized by an antibody raised against 
an antigenic polypeptide, or an antigenic fragment thereof, the antigenic polypeptide comprising 
any one of the sequences shown as even SEQ ID NOs:2-l 12 or any one of the sequences of 
polypeptides encoded by the human cDNAs of the deposited clone pool. 
25 In another aspect, the present invention provides an antibody that specifically binds to any 

of ther herein-described polypeptides and methods of binding antibody to said polypeptide. 

Li another aspect, the present invention provides a method of determining whether a 
GENSET gene is expressed within a mammal, the method comprising the steps of: a) providing a 
biological sample from said mammal; b) contacting said biological sample with either of: (i) a 
30 polynucleotide that hybridizes under stringent conditions to any of the herein-described 
polynucleotides; or (ii) a polypeptide that specifically binds to any of the herein-described 
polypeptides; and c) detecting the presence or absence of hybridization between the polynucleotide 
and an RNA species within the sample, or the presence or absence of binding of the polypeptide to 
a protein within the sample; wherein a detection of the hybridization or of the binding indicates 
35 that the GENSET gene is expressed within the mammal. 

In one embodiment, the polynucleotide is a primer, and the hybridization is detected by 
detecting the presence of an amplification product comprising the sequence of the primer. In 
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another embodiment, the polypeptide is an antibody. 

In another aspect, the present invention provides a method of determining whether a 
mammal has an elevated or reduced level of GENSET gene expression, the method comprising the 
steps of: a) providing a biological sample from the mammal; and b) comparing the amount of any 
5 of the herein-described polypeptides, or of an RNA species encoding the polypeptide, within the 
biological sample with a level detected in or expected from a control sample; wherein an increased 
amount of the polypeptide or the RNA species within the biological sample compared to the level 
detected in or expected from the control sample indicates that the mammal has an elevated level of 
the GENSET gene expression, and wherein a decreased amount of the polypeptide or the RNA 
10 species within the biological sample compared to the level detected in or expected from the control 
sample indicates that the mammal has a reduced level of the GENSET gene expression. 

In another aspect, the present invention provides a method of identifying a candidate 
modulator of a GENSET polypeptide, the method comprising: a) contacting any of the herein- 
described polypeptides with a test compound; and b) determining whether the compound 
1 5 specifically binds to the polypeptide; wherein a detection that the compound specifically binds to 
the polypeptide indicates or inhibits or activates of a specified biological activity that the 
compound is a candidate modulator of the GENSET polypeptide. 

BRIEF DESCRIPTION OF DRAWINGS 
Figure 1 is a block diagram of an exemplary computer system. 
20 Figure 2 is a flow diagram illustrating one embodiment of a process 200 for comparing a 

new nucleotide or protein sequence with a database of sequences in order to determine the identity 
levels between the new sequence and the sequences in the database. 

Figure 3 is a flow diagram illustrating one embodiment of a process 250 in a computer for 
determining whether two sequences are homologous. 
25 Figure 4 is a flow diagram illustrating one embodiment of an identifier process 300 for 

detecting the presence of a feature in a sequence. 

BRIEF DESCRIPTION OF TABLES 
Table I provides the Applicants* internal designation number (Clone ID_Clone Name) 
which corresponds to each sequence identification number (SEQ ID NO.) of the Sequence Listing, 
30 and indicates whether the sequence is a nucleic acid sequence (DNA) or a polypeptide sequence 
(PRT). Further provided is information regarding the name of the corresponding nucleic acid or 
polypeptide sequence, and information regarding the deposit of biological material. It should be 
appreciated that biological materials have been deposited with reference to their corresponding 
Clone ID, Clone Name, or both Clone ID_Clone Name. 
35 Table II provides the positions of the nucleotides of the corresponding SEQ ID NOs. of 

the Sequence Listing which comprise the open reading frame (ORF), signal peptide, mature 
peptide, polyadenylation signal, and the polyA tail of the polynucleotides of the invention. 
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Table m provides the positions of the amino acid of the corresponding SEQ ID NOs. of 
the Sequence Listing which comprise the positions of immunogenic epitopes of the polypeptides of 
the invention, which are useful in antibody generation as described in Example 1. 

Table IV provides the positions of the nucleotides comprising preferentially included or 
5 excluded fragments of the corresponding SEQ ID NOs. of the Sequence Listing. 

BRIEF DESCRIPTION OF SEQUENCES 
Sequences are presented in the accompanying Sequence Listing. 

Odd SEQ ID NOs: 1-1 1 1 are the nucleotide sequences of cDNAs, with open reading frames 
as indicated. When appropriate, the potential polyadenylation site and polyadenylation signal are 
10 also indicated. 

Even SEQ ID NOs:2-l 12 are the amino acid sequences of proteins encoded by the cDNAs 
of odd SEQ ID NOs:l~l 11. 

In accordance with the regulations relating to Sequence Listings, the following codes have 
been used in the Sequence Listing to describes nucleotide sequences. The code V in the 

15 sequences indicates that the nucleotide may be a guanine or an adenine. The code "y" in the 
sequences indicates that the nucleotide may be a thymine or a cytosine. The code "m" in the 
sequences indicates that the nucleotide may be an adenine or a cytosine. The code "k" in the 
sequences indicates that the nucleotide may be a guanine or a thymine. The code "s" in the 
sequences indicates that the nucleotide may be a guanine or a cytosine. The code "w" in the 

20 sequences indicates that the nucleotide may be an adenine or an thymine. In addition, all instances 
of the symbol "n" in the nucleic acid sequences mean that the nucleotide can be adenine, guanine, 
cytosine or thymine. 

In some instances, the polypeptide sequences in the Sequence Listing contain the symbol 

"Xaa." These M Xaa" symbols indicate either (1) a residue which cannot be identified because of 
25 nucleotide sequence ambiguity or (2) a stop codon in the determined sequence where applicants 

believe one should not exist (if the sequence were determined more accurately). In some instances, 

several possible identities of the unknown amino acids may be suggested by the genetic code. 

In the case of secreted proteins, it should be noted that, in accordance with the regulations 

governing Sequence Listings, in the appended Sequence Listing the encoded protein (i.e. the 
30 protein containing the signal peptide and the mature protein or fragment thereof) extends from an 

amino acid residue having a negative number through a positively numbered amino acid residue. 

Thus, the first amino acid of the mature protein resulting from cleavage of the signal peptide is 

designated as amino acid number 1, and the first amino acid of the signal peptide is designated 

with the appropriate negative number. 
35 In the case that a polynucleotide or polypeptide sequence described in the specification for 

SEQ ID NOs: 1-1 12 is in conflict with the corresponding sequence provided in the Sequence 

listing, the sequences provided in the Sequence listing controls. 
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IT SHOULD BE APPRECIATED THE THE POLYNUCLEOTIDE AND POLYPEPTIDE 
SEQUENCES OF SEP ID NO:l-112 OF THE SEQUENCE LISTING ARE HEREBY 
INCORPORATED BY REFERENCE IN THEIR ENTIRETIES. 

DETAILED DESCRIPTION OF THE INVENTION AND PREFERRED EMBODIMENTS 

5 Definitions. 

Before describing the invention in greater detail, the following definitions are set forth to 
illustrate and define the meaning and scope of the terms used to describe the invention herein. 

The term "GENSET gene/ 5 when used herein, encompasses genomic, mRNA and cDNA 
sequences encoding a GENSET polypeptide, including the 5' and 3 5 untranslated regions of said 
10 sequences. 

The term "GENSET polypeptide biological activity" or "GENSET biological activity" is 
intended for polypeptides exhibiting any activity similar, but not necessarily identical, to an 
activity of a GENSET polypeptide of the invention. The GENSET polypeptide biological activity 
of a given polypeptide may be assessed using any suitable biological assay, a number of which are 
15 known to those skilled in the art. In contrast, the term "biological activity" refers to any activity 
that any polypeptide may have. 

The term "corresponding mRNA" refers to mRNA which was or can be a template for 
cDNA synthesis for producing a cDNA of the present invention. 

The term "corresponding genomic DNA" refers to genomic DNA which encodes an 
20 mRNA of interest, e.g. corresponding to a cDNA of the invention, which genomic DNA includes 
the sequence of one of the strands of the mRNA, in which thymidine residues in the sequence of 
the genomic DNA (or cDNA) are replaced by uracil residues in the mRNA. 

The term "deposited clone pool" is used herein to refer to the pool of clones entitled 
cDNA-1 1-2000 deposited with the ATCC on November 27, 2000, or cDNA-8-2000, deposited 
25 with the ATCC on September 15, 2000. 

The term "heterologous", when used herein, is intended to designate any polynucleotide or 
polypeptide other than a GENSET polynucleotide or GENSET polypeptide of the invention, 
respectively. 

"Providing" with respect to, e.g. a biological sample, population of cells, etc. indicates that 
30 the sample, population of cells, etc. is somehow used in a method or procedure. Significantly, 
"providing" a biological sample or population of cells does not require that the sample or cells are 
specifically isolated or obtained for the purposes of the invention, but can instead refer, for 
example, to the use of a biological sample obtained by another individual, for another purpose. 

An "amplification producf ' refers to a product of any amplification reaction, e.g. PCR, RT- 
35 PCR,LCR,etc. 

A "modulator" of a protein or other compound refers to any agent that has a functional 
effect on the protein, including physical binding to the protein, alterations of the quantity or quality 
of expression of the protein, altering any measurable or detectable activity, property, or behavior of 
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the protein, or in any way interacts with the protein or compound. 

"A test compound" can be any molecule that is evaluated for its ability to modulate a 
protein or other compound. 

An antibody or other compound that specifically binds to a polypeptide or polynucleotide 
5 of the invention is also said to "selectively recognize" the polypeptide or polynucleotide. 

The term "isolated" with respect to a molecule requires that the molecule be removed from 
its original environment (e. g., the natural environment if it is naturally occurring). For example, a 
naturally-occurring polynucleotide or polypeptide present in a living animal is not isolated, but the 
same polynucleotide or DNA or polypeptide, separated from some or all of the coexisting materials 

10 in the natural system, is isolated. Such polynucleotide could be part of a vector and/or such 

polynucleotide or polypeptide could be part of a composition, and still be isolated in that the vector 
or composition is not part of its natural environment. For example, a naturally-occurring 
polynucleotide present in a living animal is not isolated, but the same polynucleotide, separated 
from some or all of the coexisting materials in the natural system, is isolated. Specifically 

15 excluded from the definition of "isolated" are: naturally-occurring chromosomes (such as 

chromosome spreads), artificial chromosome libraries, genomic libraries, and cDNA libraries that 
exist either as an in vitro nucleic acid preparation or as a transfected/transformed host cell 
preparation, wherein the host cells are either an in vitro heterogeneous preparation or plated as a 
heterogeneous population of single colonies. Also specifically excluded are the above libraries 

20 wherein a specified polynucleotide makes up less than 5% (may also be specified as 10%, 25%, 
50%, or 75%) of the number of nucleic acid inserts in the vector molecules. Further specifically 
excluded are whole cell genomic DNA or whole cell RNA preparations (including said whole cell 
preparations which are mechanically sheared or enzymatically digested). Further specifically 
excluded are the above whole cell preparations as either an in vitro preparation or as a 

25 heterogeneous mixture separated by electrophoresis (including blot transfers of the same) wherein 
the polynucleotide of the invention has not further been separated from the heterologous 
polynucleotides in the electrophoresis medium (e.g., further separating by excising a single band 
from a heterogeneous band population in an agarose gel or nylon blot). 

The term "purified" does not require absolute purity; rather, it is intended as a relative 

30 definition. Purification of starting material or natural material to at least one order of magnitude, 
preferably two or three orders, and more preferably four or five orders of magnitude is expressly 
contemplated. 

The term "purified" is further used herein to describe a polypeptide or polynucleotide of 
the invention which has been separated from other compounds including, but not limited to, 
35 polypeptides or polynucleotides, carbohydrates, lipids, etc. The term "purified" may be used to 
specify the separation of monomelic polypeptides of the invention from oligomeric forms such as 
homo- or hetero- dimers, trimers, etc. The term "purified" may also be used to specify the 
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separation of covalently closed (i.e. circular) polynucleotides from linear polynucleotides. A 
substantially pure polypeptide or polynucleotide typically comprises about 50%, preferably 60 to 
90% weight/weight of a polypeptide or polynucleotide sample, respectively, more usually about 
95%, and preferably is over about 99% pure but, may be specificed as any integer of percent 

5 between 50 and 100. Polypeptide and polynucleotide purity, or homogeneity, is indicated by a 
number of means well known in the art, such as agarose or polyacrylamide gel electrophoresis of a 
sample, followed by visualizing a single band upon staining the gel. For certain purposes higher 
resolution can be provided by using HPLC or other means well known in the art. As an alternative 
embodiment, purification of the polypeptides and polynucleotides of the present invention may be 

10 expressed as "at least? 3 a percent purity relative to heterologous polypeptides and polynucleotides 
(DNA, RNA or both). As a preferred embodiment, the polypeptides and polynucleotides of the 
present invention are at least; 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 96%, 96%, 
98%, 99%, or 100% pure relative to heterologous polypeptides and polynucleotides, respectively. 
As a further preferred embodiment the polypeptides and polynucleotides have a purity ranging 

1 5 from any number, to the thousandth position, between 90% and 100% (e.g., a polypeptide or 

polynucleotide at least 99.995% pure) relative to either heterologous polypeptides or polynucleotides, 
respectively, or as a weight/weight ratio relative to all compounds and molecules other than those 
existing in the carrier. Each number representing a percent purity, to the thousandth position, may 
be claimed as individual species of purity. 

20 As used interchangeably herein, the terms " nucleic acid moleculefsY '. "oligonucleotide^) " . 

and " polynucleotide^) " include RNA or DNA (either single or double stranded, coding, 
complementary or antisense), or RNA/DNA hybrid sequences of more than one nucleotide in 
either single chain or duplex form (although each of the above species may be particularly 
specified). The term " nucleotide" is used herein as an adjective to describe molecules comprising 

25 RNA, DNA, or RNA/DNA hybrid sequences of any length in single-stranded or duplex form. 
More precisely, the expression "nucleotide sequence" encompasses the nucleic material itself and 
is thus not restricted to the sequence information (i.e. the succession of letters chosen among the 
four base letters) that biochemically characterizes a specific DNA or RNA molecule. The term 
"nucleotide" is also used herein as a noun to refer to individual nucleotides or varieties of 

30 nucleotides, meaning a molecule, or individual unit in a larger nucleic acid molecule, comprising a 
purine or pyrimidine, a ribose or deoxyribose sugar moiety, and a phosphate group, or 
phosphodiester linkage in the case of nucleotides within an oligonucleotide or polynucleotide. The 
term "nucleotide" is also used herein to encompass "modified nucleotides" which comprise at least 
one modification such as (a) an alternative linking group, (b) an analogous form of purine, (c) an 

35 analogous form of pyrimidine, or (d) an analogous sugar. For examples of analogous linking 
groups, purine, pyrimidines, and sugars, see, for example, PCT publication No. WO 95/04064, 
which disclosure is hereby incorporated by reference in its entirety. Preferred modifications of the 



10 



WO 02/094864 



PCT/IB01/01715 



present invention include, but are not limited to, 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 
5-iodouracil, hypoxanthine, xantine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 
5-carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, 
beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 
5 2,2-dimethylguanine, 2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, 
N6-adenine, 7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminoniethyl-2-thiouracil, 
beta-D-mannosylqueosine, 5-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6- 
isopentenyladenine, uracil-5-oxyacetic acid (v) ybutoxosine, pseudouracil, queosine, 
2-thiocytosine, 5-methyl-2-thiouraciI, 2-thiouraciI, 4-thiouracil, 5-methyluracil, uracil-5-oxyacetic 

10 acid methylester, uracil-5-oxyacetic acid, 5-methyl-2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) 
uracil, and 2,6-diaminopurine. The polynucleotide sequences of the invention may be prepared by 
any known method, including synthetic, recombinant, ex vivo generation, or a combination thereof, 
as well as utilizing any purification methods known in the art. Methylenemethylimino linked 
oligonucleosides as well as mixed backbone compounds having, may be prepared as described in 

15 U.S. Pat. Nos. 5,378,825; 5,386,023; 5,489,677; 5,602,240; and 5,610,289, which disclosures are 
hereby incorporated by reference in their entireties. Formacetal and thioformacetal linked 
oligonucleosides may be prepared as described in U.S. Pat. Nos. 5,264,562 and 5,264,564, which 
disclosures are hereby incorporated by reference in their entireties. Ethylene oxide linked 
oligonucleosides may be prepared as described in U.S. Pat. No. 5,223,61 8, which disclosure is 

20 hereby incorporated by reference in its entirety. Phosphinate oligonucleotides may be prepared as 
described in U.S. Pat. No. 5,508,270, which disclosure is hereby incorporated by reference in its 
entirety. Alkyl phosphonate oligonucleotides may be prepared as described in U.S. Pat. No. 
4,469,863, which disclosure is hereby incorporated by reference in its entirety. 3'-Deoxy-3 f - 
methylene phosphonate oligonucleotides may be prepared as described in U.S. Pat. Nos. 5,610,289 

25 or 5,625,050 which disclosures are hereby incorporated by reference in their entireties. 

Phosphoramidite oligonucleotides may be prepared as described in U.S. Pat. No. 5,256,775 or U.S. 
Pat. No. 5,366,878 which disclosures are hereby incorporated by reference in their entireties. 
Alkylphosphonothioate oligonucleotides may be prepared as described in published PCT 
applications WO 94/17093 and WO 94/02499 which disclosures are hereby incorporated by 

30 reference in their entireties. 3-Deoxy-3 -amino phosphoramidate oligonucleotides may be 
prepared as described in U.S. Pat. No. 5,476,925, which disclosure is hereby incorporated by 
reference in its entirety. Phosphotriester oligonucleotides may be prepared as described in U.S. 
Pat. No. 5,023,243, which disclosure is hereby incorporated by reference in its entirety. Borano 
phosphate oligonucleotides may be prepared as described in U.S. Pat Nos. 5,130,302 and 

35 5,177,198 which disclosures are hereby incorporated by reference in their entireties. 

The term "upstream" is used herein to refer to a location which is toward the 5* end of the 
polynucleotide from a specific reference point. 
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The terms "base paired" and "Watson & Crick base paired" are used interchangeably 
herein to refer to nucleotides which can be hydrogen bonded to one another by virtue of then- 
sequence identities in a manner like that found in double-helical DNA with thymine or uracil 
residues linked to adenine residues by two hydrogen bonds and cytosine and guanine residues 

5 linked by three hydrogen bonds (see Stryer, (1995) Biochemistry, 4th edition, which disclosure is 
hereby incorporated by reference in its entirety). 

The terms "complementary" or "complement thereof are used herein to refer to the 
sequences of polynucleotides which is capable of forming Watson & Crick base pairing with 
another specified polynucleotide throughout the entirety of the complementary region. For the 

10 purpose of the present invention, a first polynucleotide is deemed to be complementary to a second 
polynucleotide when each base in the first polynucleotide is paired with its complementary base. 
Complementary bases are, generally, A and T (or A and U), or C and G. "Complement" is used 
herein as a synonym from "complementary polynucleotide", "complementary nucleic acid" and 
"complementary nucleotide sequence". These terms are applied to pairs of polynucleotides based 

15 solely upon their sequences and not any particular set of conditions under which the two 

polynucleotides would actually bind. Unless otherwise stated, all complementary polynucleotides 
are fully complementary on the whole length of the considered polynucleotide. 

The terms "polypeptide" and "protein", used interchangeably herein, refer to a polymer of 
amino acids without regard to the length of the polymer; thus, peptides, oligopeptides, and proteins 

20 are included within the definition of polypeptide. This term also does not specify or exclude 

chemical or post-expression modifications of the polypeptides of the invention, although chemical 
or post-expression modifications of these polypeptides may be included or excluded as specific 
embodiments. Therefore, for example, modifications to polypeptides that include the covalent 
attachment of glycosyl groups, acetyl groups, phosphate groups, lipid groups and the like are 

25 expressly encompassed by the term polypeptide. Further, polypeptides with these modifications 
may be specified as individual species to be included or excluded from the present invention. The 
natural or other chemical modifications, such as those listed in examples above can occur 
anywhere in a polypeptide, including the peptide backbone, the amino acid side-chains and the 
amino or carboxyl termini. It will be appreciated that the same type of modification may be 

30 present in the same or varying degrees at several sites in a given polypeptide. Also, a given 
polypeptide may contain many types of modifications. Polypeptides may be branched, for 
example, as a result of ubiquitination, and they may be cyclic, with or without branching. 
Modifications include acetylation, acylation, ADP-ribosylation, amidation, covalent attachment of 
flavin, covalent attachment of a heme moiety, covalent attachment of a nucleotide or nucleotide 

35 derivative, covalent attachment of a lipid or lipid derivative, covalent attachment of 
phosphotidylinositol, cross-linking, cyclization, disulfide bond formation, demethylation, 
formation of covalent cross-links, formation of cysteine, formation of pyroglutamate, formylation, 
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gamma-carboxylation, glycosylation, GPI anchor formation, hydroxylation, iodination, 
methylation, myristoylation, oxidation, pegylation, proteolytic processing, phosphorylation, 
prenylation, racemization, selenoylation, sulfation, transfer-RNA mediated addition of amino acids 
to proteins such as arginylation, and ubiquitination. [See, for instance Creighton, (1993), 
5 Posttranslational Covalent Modification of Proteins, W.H. Freeman and Company, New York B.C. 
Johnson, Ed., Academic Press, New York 1-12; Seifter, et aL, (1990) Meth Enzymol 182:626-646; 
Rattan et al, (1992) Ann NY Acad Sci 663:48-62]. Also included within the definition are 
polypeptides which contain one or more analogs of an amino acid (including, for example, non- 
naturally occurring amino acids, amino acids which only occur naturally in an unrelated biological 
10 system, modified amino acids from mammalian systems, etc.), polypeptides with substituted 
linkages, as well as other modifications known in the art, both naturally occurring and non- 
naturally occurring. 

As used herein, the terms "recombinant polynucleotide" and "polynucleotide construct" are 
used interchangeably to refer to linear or circular, purified or isolated polynucleotides that have 

1 5 been artificially designed and which comprise at least two nucleotide sequences that are not found 
as contiguous nucleotide sequences in their initial natural environment. In particular, these terms 
mean that the polynucleotide or cDNA is adjacent to "backbone" nucleic acid to which it is not 
adjacent in its natural environment. Additionally, to be "enriched" the cDNAs will represent 5% or 
more of the number of nucleic acid inserts in a population of nucleic acid backbone molecules. 

20 Backbone molecules according to the present invention include nucleic acids such as expression 
vectors, self-replicating nucleic acids, viruses, integrating nucleic acids, and other vectors or 
nucleic acids used to maintain or manipulate a nucleic acid insert of interest Preferably, the 
enriched cDNAs represent 15% or more of the number of nucleic acid inserts in the population of 
recombinant backbone molecules. More preferably, the enriched cDNAs represent 50% or more of 

25 the number of nucleic acid inserts in the population of recombinant backbone molecules. In a 
highly preferred embodiment, the enriched cDNAs represent 90% or more (including any number 
between 90 and 100%, to the thousandth position, e.g., 99.5%) of the number of nucleic acid 
inserts in the population of recombinant backbone molecules. 

The term "recombinant polypeptide" is used herein to refer to polypeptides that have been 

30 artificially designed and which comprise at least two polypeptide sequences that are not found as 
contiguous polypeptide sequences in their initial natural environment, or to refer to polypeptides 
which have been expressed from a recombinant polynucleotide. 

As used herein, the term "operably linked" refers to a linkage of polynucleotide elements 
in a functional relationship. A sequence which is "operably linked" to a regulatory sequence such 

35 as a promoter means that said regulatory element is in the correct location and orientation in 

relation to the nucleic acid to control RNA polymerase initiation and expression of the nucleic acid 
of interest. For instance, a promoter or enhancer is operably linked to a coding sequence if it 
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affects the transcription of the coding sequence. 

The term "domain" refers to an amino acid fragment with specific biological properties. 
This term encompasses all known structural and linear biological motifs. Examples of such motifs 
include but are not limited to leucine zippers, helix-turn-helix motifs, glycosylation sites, 
5 ubiquitination sites, alpha helices, and beta sheets, signal peptides which direct the secretion of 
proteins, sites for post-translational modification, enzymatic active sites, substrate binding sites, 
and enzymatic cleavage sites. 

Although each of these terms has a distinct meaning, the terms "comprising", "consisting 
of 1 and "consisting essentially of 1 may be interchanged for one another throughout the instant 
10 application. The term "having" has the same meaning as "comprising" and may be replaced with 
either the term "consisting of 1 or "consisting essentially of. 

Unless otherwise specified in the application, nucleotides and amino acids of 
polynucleotides and polypeptides, respectively, of the present invention are contiguous and not 
interrupted by heterologous sequences. 
1 5 The term "neoplastic cells" as used herein refers to cells that result from abnormal new 

growth. A neoplastic cell further includes transformed cells, cancer cells including blood cancers 
and solid tumors (benign and malignant). 

As used herein, the term "tumor" refers to an abnormal mass or population of cells that 
result from excessive cell division, whether malignant or benign, and all precancerous and 
20 cancerous cells and tissues. A "tumor" is further defined as two or more neoplastic cells. 

"Malignant tumors" are distinguished from benign growths or tumors in that, in addition to 
uncontrolled cellular proliferation, they will invade surrounding tissues and may additionally 
metastasize. 

The term "transformed cells," "malignant cells" or "cancer" are interchangeable and refer 
25 to cells that have undergone malignant transformation, but may also include lymphocyte cells that 
have undergone blast transformation. Malignant transformation is a conversion of normal cells to 
malignant cells. Transformed cells have a greater ability to cause tumors when injected into 
animals. Transformation can be recognized by changes in growth characteristics, particularly in 
requirements for macromolecular growth factors, and often also by changes in morphology. 
30 Transformed cells usually proliferate without requiring adhesion to a substratum and usually lack 
cell to cell inhibition and pile up after forming a monolayer in cell culture. 

The term "neoplastic disease" as used herein refers to a condition characterized by 
uncontrolled, abnormal growth of cells. Neoplastic diseases include cancer. Examples of cancer 
include but are not limited to, carcinoma, lymphoma, blastoma, sarcoma, and leukemia. More 
35 particular examples of such cancers include breast cancer, prostate cancer, colon cancer, squamous 
cell cancer, small-cell lung cancer, non-small cell lung cancer, ovarian cancer, cervical cancer, 
gastrointestinal cancer, pancreatic cancer, glioblastoma, liver cancer, bladder cancer, hepatoma, 
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colorectal cancer, uterine cervical cancer, endometrial carcinoma, salivary gland carcinoma, 
kidney cancer, vulval cancer, thyroid cancer, hepatic carcinoma, skin cancer, melanoma, brain 
cancer, ovarian cancer, neuroblastoma, myeloma, various types of head and neck cancer, acute 
lymphoblastic leukemia, acute myeloid leukemia, Ewing sarcoma and peripheral 

5 neuroepithelioma. All of the possible cancers listed herein are included in, or may be excluded 
from, the present invention as individual species. 

As used herein, the term "carcinoma" refers to a new growth that arises from epithelium, 
found in skin or, more commonly, the lining of body organs (adenocarcinoma), for example: 
breast, prostate, lung, stomach or bowel. Carcinomas include bladder carcinoma, 

10 hepatocarcinoma, hepatoblastoma, rhabdomyosarcoma, ovarian carcinoma, cervical carcinoma, 
lung carcinoma, breast carcinoma, colorectal carcinoma, uterine cervical cancer carcinoma, 
endometrioid carcinoma, paraganglioma, squamous cell carcinoma in head and neck, esophageal 
carcinoma, thyroid carcinoma, astrocytoma, neuroblastoma and neuroepithelioma. All of the 
possible carcinomas listed herein are included in, or may be excluded from, the present invention 

15 as individual species. 

The term "immortalized cells" as used herein refers to cells reproduce indefinitely. The 
cells escape from the normal limitation on growth of a finite number of division cycles. The term 
does not include malignant cells. 

The term "normal cells" as used herein refers to cells that have a limitation on growth, i.e. 

20 a finite number of division cycles (the Hayflick limit); therefore, is a nontumorigenic cell. Normal 
cell include primary cells, which is a cell or cell line taken directly from a living organism which is 
not immortalized. 

The term "cell cycle" as used herein refers to the cyclic biochemical and structural events 
occurring during growth and division of cells. The stages of the cell cycle include G 0 (Gap 0; rest 
25 phase), Gl (Gap 1), S phase (DNA synthesis), G2 (Gap 2) and M phase (mitosis). 

The term "cell growth" as used herein refers to an increase in the size of a population of 

cells. 

The term "cell division" as used herein refers to mitosis, i.e., the process of cell 
reproduction. 

30 The term "proliferation" as used herein means growth and division of cells. "Actively 

proliferating" means cells that are actively growing and dividing. 

The term "inhibiting cellular proliferation" as used herein refers to slowing and/or 
preventing the growth and division of cells. Cells may further be specified as being arrested in a 
particular cell cycle stage: Gl (Gap 1), S phase (DNA synthesis), G2 (Gap 2) or M phase (mitosis). 
35 The term "preferentially inhibiting cellular proliferation" as used herein refers to slowing 

and/or preventing the growth and division of cells as compared to normal cells. 

The term "metastasis" refers to the transfer of disease (e.g., cancer) from one organ and/or 
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tissue to another not directly connected with it. As used herein, metastasis refers to neoplastic cell 
growth in an unregulated fashion and spread to distal tissues and organs of the body. 

The term "inhibiting metastasis" refers to slowing and/or preventing metastasis or the 
spread of neoplastic cells to a site remote from the primary growth area. 
5 The term "invasion" as used herein refers to the spread of cancerous cells to surrounding 

tissues. 

The term "inhibiting invasion" refers to slowing and/or preventing the spread of cancerous 
cells to surrounding tissues. 

The term "apoptosis" as used herein refers to programmed cell death as signaled by the 

10 nuclei in normally functioning human and animal cells when age or state of cell health and 

condition dictates. "Apoptosis" is an active process requiring metabolic activity by the dying cell, 
often characterized by cleavage of the DNA into fragments that give a so called laddering pattern 
on gels. Cells that die by apoptosis do not usually elicit the inflammatory responses that are 
associated with necrosis, though the reasons are not clear. Cancerous cells, however, are unable to 

15 experience, or have a reduction in, the normal cell transduction or apoptosis-driven natural cell 
death process. Morphologically, apoptosis is characterized by loss of contact with neighboring 
cells, concentration of cytoplasm, endonuclease activity-associated chromatin condensation and 
pyknosis, and segmentation of the nucleus, among others. 

The term "necrosis" as used herein refers to the sum of the morphological changes 

20 indicative of cell death and caused by the progressive degradative action of enzymes, it may affect 
groups of cells or part of a structure or an organ. Morphologically, necrosis is characterized by 
marked swelling of mitochondria, swelling of cytoplasm and nuclear alteration, followed by cell 
destruction and autolysis. It occurs passively or incidentally. 

The term "inducing apoptosis" refers to increasing the number of cells that undergo 

25 apoptosis, or the rate by which cells undergo apoptosis, in a given cell population. Preferably the 
increase is at least 1.25, 1.5, 2, 5, 10, 50, 100, 500 or 1000 fold increase as compared to normal, 
untreated or negative control cells. 

The term "inhibiting apoptosis" refers to any decrease in the number of cells which 
undergo apoptosis relative to an untreated control. Preferably, the decrease is at least 1.25, 1.5, 2, 

30 5, 10, 50, 100, 500 or 1000 fold decrease as compared to normal, untreated or negative control 
cells. 

An "effective amount" of a composition disclosed herein or an agonist thereof, in reference 
to "inhibiting the cellular proliferation" of a neoplastic cell, is an amount capable of inhibiting, to 
some extent, the growth of target cells. The term further includes an amount capable of invoking a 
35 growth inhibitory, cytostatic and/or cytotoxic effect and/or apoptosis and/or necrosis of the target 
cells. An "effective amount" of a polypeptide of the present invention or an agonist thereof for 
purposes of inhibiting neoplastic cell growth may be determined empirically and in a routine 
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manner using methods well known in the art. 

A "therapeutically effective amount", in reference to the treatment of neoplastic disease or 
neoplastic cells, refers to an amount capable of invoking one or more of the following effects: 
(1) inhibition, to some extent, of tumor growth, including, (i) slowing down and (ii) complete 
5 growth arrest; (2) reduction in the number of tumor cells; (3) maintaining tumor size; (4) reduction 
in tumor size; (5) inhibition, including (i) reduction, (ii) slowing down or (iii) complete prevention, 
of tumor cell infiltration into peripheral organs; (6) inhibition, including (i) reduction, (ii) slowing 
down or (iii) complete prevention, of metastasis; (7) enhancement of anti-tumor immune response, 
which may result in (i) maintaining tumor size, (ii) reducing tumor size, (iii) slowing the growth of 
10 a tumor, (iv) reducing, slowing or preventing invasion or (v) reducing, slowing or preventing 

metastasis; and/or (8) relief, to some extent, of one or more symptoms associated with the disorder. 
A "therapeutically effective amount" of a polypeptide of the present invention or an agonist thereof 
for purposes of treatment of tumor may be determined empirically and in a routine manner. 

A "growth inhibitory amount" of a Polypeptide of the present invention or an agonist 
15 thereof is an amount capable of inhibiting the growth of a cell, especially a malignant tumor cell, 
e.g., cancer cell, either in vitro or in vivo. A "growth inhibitory amount" of a polypeptide of the 
present invention or an agonist thereof for purposes of inhibiting neoplastic cell growth may be 
determined empirically and in a routine manner using methods well known in the art. 

A "cytotoxic amount" of a polypeptide of the present invention or an agonist thereof is an 
20 amount capable of causing the destruction of a cell, especially tumor, e.g., cancer cell, either in 
vitro or in vivo. A "cytotoxic amount" of a polypeptide of the present invention or an agonist 
thereof for purposes of inhibiting neoplastic cell growth may be determined empirically and in a 
routine manner using methods well known in the art. 

The terms "killing" or "inducing cytotoxicity" as used herein refer to inducing cell death 
25 by either apoptosis and/or necrosis, whereby embodiments of the invention include only apoptosis, 
only necrosis and both apoptosis and necrosis. 

The term "cytotoxic agent" as used herein refers to a substance that inhibits or prevents the 
function of cells, for example by inhibiting progression of the cell cycle, and/or causes cell death. 
The term is intended to include radioactive isotopes, chemotherapeutic agents, and toxins such as 
30 enzymatically active toxins of bacterial, fungal, plant or animal origin, or fragments thereof. 

The term "preventing" as used herein refers to administering a compound prior to the onset 
of clinical symptoms of a disease or condition so as to prevent a physical manifestation of the 
disease or condition. Alternatively, the term "preventing" can also be used to signify the 
reduction, or severity, of clinical symptoms associated with a disease or condition. 
35 "Suppression" involves administration of drug prior to the clinical appearance of disease. 

The term ''treating" as used herein refers to administering a compound after the onset of 
clinical symptoms. 



17 



WO 02/094864 



PCT/IB01/01715 



In human and veterinary medicine, we use the term "prophylaxis" as distinct from 
"treatment" to encompass "preventing" and "suppressing". Herein, "protection" includes 
"prophylaxis". Protection need not be absolute to be useful. 

The term "in need of treatment" as used herein refers to a judgment made by a caregiver 
5 (e.g. physician, nurse, nurse practitioner, etc in the case of humans; veterinarian in the case of 
animals, including non-human mammals) that an individual or animal requires or will benefit from 
treatment. This judgment is made based on a variety of factors that are in the realm of a 
caregiver's expertise, but that include the knowledge that the individual or animal is ill, or will be 
ill, as the result of a condition that is treatable by the compounds of the invention. 
10 The term "perceives a need for treatment 5 ' refers to a sub-clinical determination that an 

individual desires treatment. The term "perceives a need for treatment" in other embodiments can 
refer to the decision that an owner of an animal makes for treatment of the animal. 

The term "individual" or "patient" as used herein refers to any animal, including mammals, 
preferably mice, rats, other rodents, rabbits, dogs, cats, swine, cattle, sheep, horses, or primates, 
15 and most preferably humans. The term may specify male or female or both, or exclude male or 
female. 

As used herein, the term "non-human animal" refers to any non-human animal, including 
insects, birds, rodents and more usually mammals. Preferred non-human animals include: 
primates; farm animals such as swine, goats, sheep, donkeys, cattle, horses, chickens, rabbits; and 
20 rodents, preferably rats or mice. As used herein, the term "animal" is used to refer to any species 
in the animal kingdom, preferably vertebrates, including birds and fish, and more preferable a 
mammal. Both the terms "animal" and "mammal" expressly embrace human subjects unless 
preceded with the term "non-human". 

As used herein, the terms "physiologically acceptable," "phannaceutically acceptable," and 

25 "pharmaceutical" are interchangeable. 

Identity Between Nucleic Acids Or Polypeptides 

The terms "percentage of sequence identity" and "percentage homology" are used 
interchangeably herein to refer to comparisons among polynucleotides and polypeptides, and are 
determined by comparing two optimally aligned sequences over a comparison window, wherein 

30 the portion of the polynucleotide or polypeptide sequence in the comparison window may 

comprise additions or deletions (i.e., gaps) as compared to the reference sequence (which does not 
comprise additions or deletions) for optimal alignment of the two sequences. The percentage is 
calculated by determining the number of positions at which the identical nucleic acid base or 
amino acid residue occurs in both sequences to yield the number of matched positions, dividing the 

35 number of matched positions by the total number of positions in the window of comparison and 
multiplying the result by 100 to yield the percentage of sequence identity. Identity is evaluated 
using any of the variety of sequence comparison algorithms and programs known in the art. Such 
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algorithms and programs include, but are by no means limited to, TBLASTN, BLASTP, FASTA, 
TFASTA, CLUSTALW, FASTDB [Pearson and Lipman, (1988), Proc. Natl. Acad. Sci. USA 
85(8):2444-2448; Altschul et al, (1990), J. Mol. Biol. 215(3):403-410; Thompson et al (1994), 
Nucleic Acids Res. 22(2):4673-4680; Higgins et al, (1996), Meth. Enzymol. 266:383-402; 
5 Altschul et al, (1 993), Nature Genetics 3:266-272; Brutlag et al (1990) Comp. App. Biosci. 
6:237-24], the disclosures of which are incorporated by reference in their entireties. 

In a particularly preferred embodiment, protein and nucleic acid sequence identities are 
evaluated using the Basic Local Alignment Search Tool ("BLAST") which is well known in the art 
[e.g., Karlin and Altschul, (1990), Proc. Natl. Acad. Sci. USA 87:2267-2268; Altschul et al, 
10 (1997), Nuc. Acids Res. 25:3389-3402] the disclosures of which are incorporated by reference in 
their entireties. In particular, five specific BLAST programs are used to perform the following 
task: 

(1) LASTP and BLAST3 compare an amino acid query sequence against a protein 
sequence database; 

15 (2) BLASTN compares a nucleotide query sequence against a nucleotide sequence 

database; 

(3) LASIX compares the six-frame conceptual translation products of a query 
nucleotide sequence (both strands) against a protein sequence database; 

(4) BLASTN compares a query protein sequence against a nucleotide sequence 
20 database translated in all six reading frames (both strands); and 

(5) BLASTX compares the six-frame translations of a nucleotide query sequence 
against the six-frame translations of a nucleotide sequence database. 

The BLAST programs identify homologous sequences by identifying similar segments, 
which are referred to herein as "high-scoring segment pairs," between a query amino or nucleic 

25 acid sequence and a test sequence which is preferably obtained from a protein or nucleic acid 

sequence database. High-scoring segment pairs are preferably identified (i.e., aligned) by means of 
a scoring matrix, many of which are known in the art. Preferably, the scoring matrix used is the 
BLOSUM62 matrix [Gonnet et al, (1992), Science 256:1443-1445; Henikoff and Henikoff, 
(1993), Proteins 17:49-61, the disclosures of which are incorporated by reference in their 

30 entireties]. Less preferably, the PAM or PAM250 matrices may also be used [see, e.g., Schwartz 
and Dayhoff, (1978), eds., Matrices for Detecting Distance Relationships: Atlas of Protein 
Sequence and Structure, Washington: National Biomedical Research Foundation, the disclosure of 
which is incorporated by reference in its entirety]. The BLAST programs evaluate the statistical 
significance of all high-scoring segment pairs identified, and preferably selects those segments 

35 which satisfy a user-specified threshold of significance, such as a user-specified percent homology. 
Preferably, the statistical significance of a high-scoring segment pair is evaluated using the 
statistical significance formula of Karlin (see, e.g., Karlin and Altschul, 1990), the disclosure of 
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which is incorporated by reference in its entirety. The BLAST programs may be used with the 
default parameters or with modified parameters provided by the user. 

Another preferred method for determining the best overall match between a query 
nucleotide sequence (a sequence of the present invention) and a subject sequence, also referred to 
5 as a global sequence alignment, can be determined using the FASTDB computer program based on 
the algorithm of Brutlag et al. (1990), the disclosure of which is incorporated by reference in its 
entirety. In a sequence alignment the query and subject sequences are both DNA sequences. An 
RNA sequence can be compared by first converting U's to Ts. The result of said global sequence 
alignment is in percent identity. Preferred parameters used in a FASTDB alignment of DNA 

10 sequences to calculate percent identity are: Matrix = Unitary, k-tuple = 4, Mismatch Penalty = 1, 
Joining Penalty = 30, Randomization Group Length = 0, Cutoff Score = 1, Gap Penalty = 5, Gap 
Size Penalty = 0.05, Window Size = 500 or the length of the subject nucleotide sequence, 
whichever is shorter. If the subject sequence is shorter than the query sequence because of 5 1 or 3' 
deletions, not because of internal deletions, a manual correction must be made to the results. This 

15 is because the FASTDB program does not account for 5* and 3* truncations of the subject sequence 
when calculating percent identity. For subject sequences truncated at the 5 1 or 3'ends, relative to 
the query sequence, the percent identity is corrected by calculating the number of bases of the 
query sequence that are 5' and 3' of the subject sequence, which are not matched/aligned, as a 
percent of the total bases of the query sequence. Whether a nucleotide is matched/aligned is 

20 determined by results of the FASTDB sequence alignment. This percentage is then subtracted 
from the percent identity, calculated by the above FASTDB program using 10, the specified 
parameters, to arrive at a final percent identity score. This corrected score is what is used for the 
purposes of the present invention. Only nucleotides outside the 5* and 3' nucleotides of the subject 
sequence, as displayed by the FASTDB alignment, which are not matched/aligned with the query 

25 sequence, are calculated for the purposes of manually adjusting the percent identity score. For 
example, a 90 nucleotide subject sequence is aligned to a 100 nucleotide query sequence to 
determine percent identity. The deletions occur at the 5 f end of the subject sequence and therefore, 
the FASTDB alignment does not show a matched/alignment of the first 10 nucleotides at 5' end. 
The 10 unpaired nucleotides represent 10% of the sequence (number of nucleotides at the 5 1 and 3* 

30 ends not matched/total number of nucleotides in the query sequence) so 10% is subtracted from the 
percent identity score calculated by the FASTDB program. If the remaining 90 nucleotides were 
perfectly matched the final percent identity would be 90%. In another example, a 90 nucleotide 
subject sequence is compared with a 100 nucleotide query sequence. This time the deletions are 
internal deletions so that there are no nucleotides on the 5 f or 3' of the subject sequence which are 

35 not matched/aligned with the query, hi this case the percent identity calculated by FASTDB is not 
manually corrected. Once again, only nucleotides 5' and 3' of the subject sequence which are not 
matched/aligned with the query sequence are manually corrected. No other manual corrections are 
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made for the purposes of the present invention. 

Another preferred method for determining the best overall match between a query amino 
acid sequence (a sequence of the present invention) and a subject sequence, also referred to as a 
global sequence alignment, can be determined using the FASTDB computer program based on the 

5 algorithm of Brutlag et al (1990). In a sequence alignment the query and subject sequences are 
both amino acid sequences. The result of said global sequence alignment is in percent identity. 
Preferred parameters used in a FASTDB amino acid alignment are: Matrix = PAM 0, k-tuple = 2, 
Mismatch Penalty = 1, Joining Penalty = 20, Randomization Group Length = 0, Cutoff Score = 1, 
Window Size = sequence length, Gap Penalty = 5, Gap Size Penalty = 0.05, Window Size = 500 or 

10 the length of the subject amino acid sequence, whichever is shorter. If the subject sequence is 
shorter than the query sequence due to N-or C-terminal deletions, not because of internal deletions, 
the results, in percent identity, must be manually corrected. This is because the FASTDB program 
does not account for N- and C-terminal truncations of the subject sequence when calculating global 
percent identity. For subject sequences truncated at the N- and C-termini, relative to the query 

15 sequence, the percent identity is corrected by calculating the number of residues of the query 

sequence that are N- and C- terminal of the subject sequence, which are not matched/aligned with a 
corresponding subject residue, as a percent of the total bases of the query sequence. Whether a 
residue is matched/aligned is determined by results of the FASTDB sequence alignment This 
percentage is then subtracted from the percent identity, calculated by the above FASTDB program 

20 using the specified parameters, to arrive at a final percent identity score. This final percent identity 
score is what is used for the purposes of the present invention. Only residues to the N- and O 
termini of the subject sequence, which are not matched/aligned with the query sequence, are 
considered for the purposes of manually adjusting the percent identity score. That is, only query 
amino acid residues outside the farthest N- and C-terminal residues of the subject sequence. For 

25 example, a 90 amino acid residue subject sequence is aligned with a 100-residue query sequence to 
determine percent identity. The deletion occurs at the N-terminus of the subject sequence and 
therefore, the FASTDB alignment does not match/align with the first residues at the N-terminus. 
The 10 unpaired residues represent 10% of the sequence (number of residues at the N- and C- 
termini not matched/total number of residues in the query sequence) so 10% is subtracted from the 

30 percent identity score calculated by the FASTDB program. If the remaining 90 residues were 
perfectly matched the final percent identity would be 90%. hi another example, a 90-residue 
subject sequence is compared with a 100-residue query sequence. This time the deletions are 
internal so there are no residues at the N- or C-termini of the subject sequence, which are not 
matched/aligned with the query. In this case the percent identity calculated by FASTDB is not 

35 manually corrected. Once again, only residue positions outside the N- and C-terminal ends of the 
subject sequence, as displayed in the FASTDB alignment, which are not matched/aligned with the 
query sequence are manually corrected. No other manual corrections are made for the purposes of 
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the present invention. 

THE term "percentage of sequence similarity" refers to comparisons between 

POLYPEPTIDE SEQUENCES AND IS DETERMINED BY COMPARING TWO OPTIMALLY ALIGNED 
SEQUENCES OVER A COMPARISON WINDOW, WHEREIN THE PORTION OF THE POLYPEPTIDE 
5 SEQUENCE IN THE COMPARISON WINDOW MAY COMPRISE ADDITIONS OR DELETIONS (I.E., 
GAPS^ AS COMPARED TO THE REFERENCE SEQUENCE (WHICH DOES NOT COMPRISE ADDITIONS 
OR DELETIONS) FOR OPTIMAL ALIGNMENT OF THE TWO SEQUENCES. THE PERCENTAGE IS 
CALCULATED BY DETERMINING THE NUMBER OF POSITIONS AT WHICH AN IDENTICAL OR 
EQUIVALENT AMINO ACID RESIDUE OCCURS IN BOTH SEQUENCES TO YIELD THE NUMBER OF 
10 MATCHED POSITIONS. DIVIDING THE NUMBER OF MATCHED POSITIONS BY THE TOTAL 

NUMBER OF POSITIONS IN THE WINDOW OF COMPARISON AND MULTIPLYING THE RESULT BY 
100 TO YIELD THE PERCENTAGE OF SEQUENCE SIMILARITY. SIMILARITY IS EVALUATED 
USING ANY OF THE VARIETY OF SEQUENCE COMPARISON ALGORITHMS AND PROGRAMS 
KNOWN IN THE ART, INCLUDING THOSE DESCRIBED ABOVE IN THIS SECTION. EQUIVALENT 
15 AMINO ACID RESIDUES ARE DEFINED HEREIN IN THE "MUTATED POLYPEPTIDES* SECTION. 

POLYNUCLEOTIDES OF THE INVENTION 
The present invention concerns GENSET genomic and cDNA sequences. The present 
invention encompasses GENSET genes, polynucleotides comprising GENSET genomic and cDNA 
sequences, as well as fragments and variants thereof. These polynucleotides may be purified, 
20 isolated, or recombinant. 

Also encompassed by the present invention are allelic variants, orthologs, splice variants, 
and/or species homologues of the GENSET genes. Procedures known in the art can be used to 
obtain full-length genes and cDNAs, allelic variants, splice variants, full-length coding portions, 
orthologs, and/or species homologues of genes and cDNAs corresponding to a nucleotide sequence 
25 selected from the group consisting of sequences of odd SEQ ID NOs: 1-1 1 1 and sequences of clone 
inserts of the deposited clone pool, using information from the sequences disclosed herein or the 
clone pool deposited with the ATCC. For example, allelic variants, orthologs and/or species 
homologues may be isolated and identified by making suitable probes or primers from the 
sequences provided herein and screening a suitable nucleic acid source for allelic variants and/or 
30 the desired homologue using any technique known to those skilled in the art including those 
described into the section entitled "To find similar sequences". 

In a specific embodiment, the polynucleotides of the invention are at least 15, 30, 50, 100, 
125, 500, or 1000 continuous nucleotides. In another embodiment, the polynucleotides are less 
than or equal to 300kb, 200kb, lOOkb, 50kb, lOkb, 7.5kb, 5kb, 2.5kb, 2kb, 1.5kb, or lkb in length. 
35 In a further embodiment, polynucleotides of the invention comprise a portion of the coding 
sequences, as disclosed herein, but do not comprise all or a portion of any intron. In another 
embodiment, the polynucleotides comprising coding sequences do not contain coding sequences of 
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a genomic flanking gene (i.e., 5' or V to the gene of interest in the genome). In other 
embodiments, the polynucleotides of the invention do not contain the coding sequence of more 
than 1000, 500, 250, 100, 75, 50, 25, 20, 15, 10, 5, 4, 3, 2, or 1 naturally occurring genomic 
flanking gene(s). 
5 Deposited clone pool of the invention 

Expression of GENSET genes has been shown to lead to the production of at least one 
mRNA species per GENSET gene, which cDNA sequence is set forth in the appended Sequence 
Listing as odd SEQ ID NOs: 1-1 1 1 . The cDNAs corresponding to these GENSET mRNA species 
were cloned either in the vector pBluescriptn SK" (Stratagene) or in a vector called pPT. Cells 

10 containing the cloned cDNAs of the present invention are maintained in permanent deposit by the 
inventors at Genset, S.A., 24 Rue Royale, 75008 Paris, France. Table I provides Genset's internal 
designation number assigned to each SEQ ID NO., and indicates whether the sequence is a nucleic 
acid sequence (DNA) or a protein (PRT) sequence. Each cDNA can be removed from the 
Bluescript vector in which it was inserted by performing a NotI Pst I double digestion, or from the 

1 5 pPT vector by performing a Muni Hindin double digestion, to produce the appropriate fragment 
for each clone, provided the cDNA sequence does not contain any of the corresponding restriction 
sites within its sequence. Alternatively, other restriction enzymes of the multicloning site of the 
vector may be used to recover the desired insert as indicated by the manufacturer. 

Pools of cells containing GENSET genes as described in the Sequence Listing, from which 

20 the cells containing a particular polynucleotide is obtainable, were or will be also deposited with 
the American Tissue Culture Collection (ATCC), 10801 University Boulevard, Manassas, VA 
201 10-2209, United States. Each cDNA clone has been transfected into separate bacterial cells (E- 
coli) for these composite deposits. 

Bacterial cells containing a particular clone can be obtained from the composite deposit as 

25 follows: 

An oligonucleotide probe or probes should be designed to the sequence that is known for 
that particular clone. This sequence can be derived from the sequences provided herein, or 
from a combination of those sequences. The design of the oligonucleotide probe should 
preferably follow these parameters: 
30 (a) it should be designed to an area of the sequence which has the fewest ambiguous 

bases ("N's"), if any; 
(b) preferably, the probe is designed to have a Tm of approximately 80 degrees 
Celsius (assuming 2 degrees for each A or T and 4 degrees for each G or C). 
However, probes having melting temperatures between 40 degrees Celsius and 80 
35 degrees Celsius may also be used provided that specificity is not lost. 

The oligonucleotide should preferably be labeled with gamma[ 32 P] ATP (specific activity 
6000 Ci/mmole) and T4 polynucleotide kinase using commonly employed techniques for labeling 
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oligonucleotides. Other labeling techniques can also be used. Unincorporated label should 
preferably be removed by gel filtration chromatography or other established methods. The amount 
of radioactivity incorporated into the probe should be quantified by measurement in a scintillation 
counter. Preferably, specific activity of the resulting probe should be approximately 4xl0 6 
5 dpm/pmole. 

The bacterial culture containing the pool of full-length clones should preferably be thawed 
and 100 ul of the stock used to inoculate a sterile culture flask containing 25 ml of sterile L-broth 
containing ampicillin at 100 ug/ml. The culture should preferably be grown to saturation at 37 
degrees Celsius, and the saturated culture should preferably be diluted in fresh L-broth. Aliquots 

10 of these dilutions should preferably be plated to determine the dilution and volume which will 
yield approximately 5000 distinct and well-separated colonies on solid bacteriological media 
containing L-broth containing ampicillin at 100 ug/ml and agar at 1.5% in a 150 mm petri dish 
when grown overnight at 37 degrees Celsius. Other known methods of obtaining distinct, well- 
separated colonies can also be employed. 

15 Standard colony hybridization procedures should then be used to transfer the colonies to 

nitrocellulose filters and lyse, denature and bake them. 

The filter is then preferably incubated at 65 degrees Celsius for 1 hour with gentle agitation 
in 6X SSC (20X stock is 175.3 g NaCl/liter, 88.2 g Na citrate/liter, adjusted to pH 7.0 with NaOH) 
containing 0.5% SDS, 100 pg/ml of yeast RNA, and 10 mM EDTA (approximately 10 ml per 150 

20 mm filter). Preferably, the probe is then added to the hybridization mix at a concentration greater 
than or equal to lxlO 6 dpm/ml. The filter is then preferably incubated at 65 degrees Celsius with 
gentle agitation overnight. The filter is then preferably washed in 500 ml of 2X SSC/0.1% SDS at 
room temperature with gentle shaking for 15 minutes. A third wash with 0. IX SSC/0.5% SDS at 
65 degrees Celsius for 30 minutes to 1 hour is optional. The filter is then preferably dried and 

25 subjected to autoradiography for sufficient time to visualize the positives on the X-ray film. Other 
known hybridization methods can also be employed. 

The positive colonies are picked, grown in culture, and plasmid DNA isolated using 
standard procedures. The clones can then be verified by restriction analysis, hybridization 
analysis, or DNA sequencing. The plasmid DNA obtained using these procedures may then be 

30 manipulated using standard cloning techniques familiar to those skilled in the art 

Alternatively, to recover cDNA inserts from the pool of bacteria, a PCR can be performed 
on plasmid DNA isolated using standard procedures and primers designed at both ends of the 
cDNA insertion, including primers designed in the multicloning site of the vector. If a specific 
cDNA of interest is to be recovered, primers may be designed in order to be specific for the 5' end 

35 and the 3' end of this cDNA using sequence information available from the appended sequence 
listing. The PCR product which corresponds to the cDNA of interest can then be manipulated 
using standard cloning techniques familiar to those skilled in the art. 
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Therefore, an object of the invention is an isolated, purified, or recombinant polynucleotide 
comprising a nucleotide sequence selected from the group consisting of human cDNA inserts of 
the deposited clone pool. Moreover, preferred polynucleotides of the invention include purified, 
isolated, or recombinant GENSET cDNAs consisting of, consisting essentially of, or comprising a 
5 nucleotide sequence selected from the group consisting of human cDNA inserts of the deposited 
clone pool. 

cDNA sequences of the invention 

Another object of the invention is a purified, isolated, or recombinant polynucleotide 
comprising a nucleotide sequence selected from the group consisting of the polynucleotide 

10 sequences of the appended Sequence Listing, the sequences of human cDNA clone inserts of the 
deposited clone pool, complementary sequences thereto, and fragments thereof. Moreover, 
preferred polynucleotides of the invention include purified, isolated, or recombinant GENSET 
cDNAs consisting of, consisting essentially of, or comprising a sequence selected from the group 
consisting of the polynucleotide sequences of the Sequence Listing and the sequences of clone 

15 inserts of the deposited clone pool. 

Structural parameters of each of the cDNAs of the present invention are presented in the appended 
Sequence Listing. Accordingly, the coding sequence (CDS) or open reading frame (ORF) of each 
cDNA of the invention refers to the nucleotide sequence beginning with the first nucleotide of the 
start codon and ending with the last nucleotide of the stop codon. Similarly, the 5' untranslated 

20 region (or 5'UTR) of each cDNA of the invention refers to the nucleotide sequence starting at 
nucleotide 1 and ending at the nucleotide immediately 5 5 to the first nucleotide of the start codon. 
The 3' untranslated region (or 3'UTR) of each cDNA of the invention refers to the nucleotide 
sequence starting at the nucleotide immediately 3' to the last nucleotide of the stop codon and 
ending at the last nucleotide of the cDNA. 

25 Untranslated regions 

In addition, the invention concerns a purified, isolated, and recombinant nucleic acid 
comprising a nucleotide sequence selected from the group consisting of the 5'UTRs of the 
polynucleotide sequences of the appended Sequence Listing, those of human cDNA clone inserts 
of the deposited clone pool, sequences complementary thereto, and allelic variants thereof. The 

30 invention also concerns a purified, isolated, and/or recombinant nucleic acid comprising a 
nucleotide sequence selected from the group consisting of the 3'UTRs of the polynucleotide 
sequences of the appended Sequence Listing, those of human cDNA clone inserts of the deposited 
clone pool, sequences complementary thereto, and allelic variants thereof. 

These polynucleotides may be used to detect the presence of GENSET mRNA species in a 

35 biological sample using either hybridization or RT-PCR techniques well known to those skilled in 
the art. 

In addition, these polynucleotides may be used as regulatory molecules able to affect the 
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processing and maturation of any polynucleotide including them (either a GENSET polynucleotide 
or an heterologous polynucleotide), preferably the localization, stability and/or translation of said 
polynucleotide including them [for a review on UTRs see Decker and Parker, (1995) Curr. Opin. 
Cell. Biol. 7(3) :368-92, Derrigo et al. 9 (2000) Int. J. Mol. Med. 5(2) :1 11-23]. In particular, 
5 3 'UTRs may be used in order to control the stability of heterologous mRNAs in recombinant 
vectors using any methods known to those skilled in the art including Makrides (1999) Protein 
Expr Purif 1999 Nov; 17(2): 183-202), US Patents 5,925,564; 5,807,707 and 5,756,264, which 
disclosures are hereby incorporated by reference in their entireties. 
Coding sequences 

10 Another object of the invention is an isolated, purified or recombinant polynucleotide 

comprising the coding sequence of a sequence selected from the group consisting of the 
polynucleotide sequences of the appended Sequence Listing, those of human cDNA clone inserts 
of the deposited clone pool and variants thereof. 

A further object of the invention is an isolated, purified, or recombinant polynucleotide 

1 5 encoding a polypeptide of the present invention. 

It will be appreciated that should the extent of the coding sequence differ from that 
indicated in the appended sequence listing as a result of a sequencing error, reverse transcription or 
amplification error, mRNA splicing, post-translational modification of the encoded protein, 
enzymatic cleavage of the encoded protein, or other biological factors, one skilled in the art would 

20 be readily able to identify the extent of the coding sequences in the polynucleotide sequences of 
the Sequence Listing, those of the human cDNA inserts of the deposited clone pool, and allelic 
variants thereof. Accordingly, the scope of any claims herein relating to nucleic acids containing 
the coding sequence of one of the polynucleotide sequences of the Sequence Listing and those of 
the cDNA inserts of the deposited clone pool is not to be construed as excluding any readily 

25 identifiable variations from or equivalents to the coding sequences described in the appended 

sequence listing. Equivalents include any alterations in a nucleotide coding sequence that does not 
result in an amino acid change, or that results in a conservative amino acid substitution, as defined 
below, in the polypeptide encoded by the nucleotide sequence. Similarly, should the extent of the 
polypeptides differ from those indicated in the appended Sequence Listing as a result of any of the 

30 preceding factors, the scope of claims relating to polypeptides comprising the amino acid sequence 
of the polypeptide sequences of the appended Sequence Listing is not to be construed as excluding 
any readily identifiable variations from or equivalents to the sequences described in the appended 
sequence listing. 

The above disclosed polynucleotides that contain the coding sequence of the GENSET 
35 genes may be expressed in a desired host cell or a desired host organism, when this polynucleotide 
is placed under the control of suitable expression signals. The expression signals may be either the 
expression signals contained in the regulatory regions in the GENSET genes of the invention or, in 
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contrast, the signals may be exogenous regulatory nucleic sequences. Such a polynucleotide, when 
placed under the suitable expression signals, may also be inserted in a vector for its expression 
and/or amplification. 

Further included in the present invention are polynucleotides encoding the polypeptides of the 
5 present invention that are fused in frame to the coding sequences for additional heterologous amino 
acid sequences. Also included in the present invention are nucleic acids encoding polypeptides of 
the present invention together with additional, non-coding sequences, including, but not limited to, 
non-coding 5' and 3' sequences, vector sequence, sequences used for purification, probing, or 
priming. For example, heterologous sequences include transcribed, untranslated sequences that 

10 may play a role in transcription and mRNA processing, such as ribosome binding and stability of 
mRNA. The heterologous sequences may alternatively comprise additional coding sequences that 
provide additional functionalities. Thus, a nucleotide sequence encoding a polypeptide may be 
fused to a tag sequence, such as a sequence encoding a peptide that facilitates purification or 
detection of the fused polypeptide. In certain preferred embodiments of this aspect of the 

1 5 invention, the tag amino acid sequence is a hexa-histidine peptide, such as the tag provided in a 
pQE vector (QIAGEN), or in any of a number of additional, commercially available vectors. For 
instance, hexa-histidine provides for the convenient purification of the fusion protein (see, Gentz et 
ah, 1989, Proc Natl Acad Sci U S A Feb; 86(3):821-4, the disclosure of which is incorporated by 
reference in its entirety). The "HA" tag is another peptide useful for purification which 

20 corresponds to an epitope derived from the influenza hemagglutinin protein (see, Wilson, et aL, 
1984, Cell Jul; 37(3):767-78, the disclosure of which is incorporated by reference in its entirety). 
As discussed below, other such fusion proteins include a GENSET polypeptide fused to Fc at the 
N- or C- terminus. 

Regulatory sequences of the invention 

25 As mentioned, the genomic sequence of GENSET genes contain regulatory sequences in 

the non-coding 5 '-flanking region and possibly in the non-coding 3 '-flanking region that border the 
GENSET polypeptide coding regions containing the exons of these genes. 

Polynucleotides derived from GENSET polynucleotide 5' and 3' regulatory regions are 
useful in order to detect the presence of at least a copy of a genomic nucleotide sequence of the 

30 GENSET gene or a fragment thereof in a test sample. 
Preferred regulatory sequences 

Polynucleotides carrying the regulatory elements located at the 5' end and at the 3' end of 
GENSET polypeptide coding regions may be advantageously used to control, e.g., the 
transcriptional and translational activity of a heterologous polynucleotide of interest. 

35 Thus, the present invention also concerns a purified or isolated nucleic acid comprising a 

polynucleotide which is selected from the group consisting of the 5' and 3' GENSET 
polynucleotide regulatory regions, sequences complementary thereto, regulatory active fragments 
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and variants thereof. 

Another object of the invention consists of purified, isolated or recombinant nucleic acids 
comprising a polynucleotide that hybridizes, under the stringent hybridization conditions defined 
herein, with a polynucleotide of the present invention. 
5 Preferred fragments of 5' and 3* regulatory regions are any one integer between 20 and 

20,000 nucleotides in length. 

For the purpose of the invention, a nucleic acid or polynucleotide is "functional" as a 
"regulatory region" for expressing a recombinant polypeptide or a recombinant polynucleotide if 
said regulatory polynucleotide contains nucleotide sequences which contain transcriptional and 
10 translational regulatory information, and such sequences are "operably linked" to nucleotide 
sequences which encode the desired polypeptide or the desired polynucleotide. The regulatory 
polynucleotides of the invention may be prepared using methods known in the art. 

The regulatory polynucleotides according to the invention may be part of a recombinant 
expression vector that may be used to express a coding sequence in a desired host cell or host 
15 organism. 

Preferred S'-regulatory polynucleotides of the invention include 5'-UTRs of GENSET 
cDNAs, or regulatory active fragments or variants thereof. 

Preferred S'-regulatory polynucleotide of the invention include 3 5 -UTRs of GENSET 
cDNAs, or regulatory active fragments or variants thereof. 
20 A further object of the invention consists of a purified or isolated nucleic acid comprising: 

a) polynucleotide comprising a 5' regulatory nucleotide sequence selected from the 
group consisting of: 

(i) a nucleotide sequence comprising a polynucleotide of a GENSET 

polynucleotide 5* regulatory region or a complementary sequence thereto; 
25 (ii) a nucleotide sequence comprising a polynucleotide having at least 95% of 

nucleotide identity with the nucleotide sequence of a GENSET 
polynucleotide 5' regulatory region or a complementary sequence thereto; 

(iii) a nucleotide sequence comprising a polynucleotide that hybridizes under 
stringent hybridization conditions with the nucleotide sequence of a 

30 GENSET polynucleotide 5' regulatory region or a complementary 

sequence thereto; and 

(iv) a regulatory active fragment or variant of the polynucleotides in (i), (ii) 
and (iii); 

b) a nucleic acid molecule encoding a desired polypeptide or a nucleic acid molecule 
35 of interest, wherein said nucleic acid molecule is operably linked to the 

polynucleotide defined in (a); and 

c) optionally, a polynucleotide comprising a 3'- regulatory polynucleotide, preferably 
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a 3'- regulatory polynucleotide of a GENSET gene. 
In a specific embodiment, the nucleic acid defined above includes the 5'-UTR of a 
GENSET cDNA, or a regulatory active fragment or variant thereof. 

The regulatory polynucleotide of the 3' regulatory region, or its regulatory active 
5 fragments or variants, is advantageously operably linked at the 3 '-end of the nucleic acid molecule 
encoding the desired polypeptide or nucleic acid molecule of interest. 

The desired polypeptide encoded by the above-described nucleic acid may be of various 
nature or origin, encompassing proteins of prokaryotic viral or eukaryotic origin. Among the 
polypeptides expressed under the control of a GENSET polynucleotide regulatory region include 
10 bacterial, fungal or viral antigens. Also encompassed are eukaryotic proteins such as intracellular 
proteins, such as "house keeping" proteins, membrane-bound proteins, such as mitochondrial 
membrane-bound proteins and cell surface receptors, and secreted proteins such as endogenous 
mediators such as cytokines. The desired polypeptide may be a heterologous polypeptide or a 
GENSET polypeptide, especially a protein with an amino acid sequence selected from the group 
15 consisting of the polypeptide sequences of the Sequence Listing, those encoded by the cDNA 
inserts of the deposited clone pool, fragments and variants thereof. 

The desired nucleic acids encoded by the above-described polynucleotides, usually an 
RNA molecule, may be complementary to a desired coding polynucleotide, for example to a 
GENSET coding sequence, and thus useful as an antisense polynucleotide. Such a polynucleotide 
20 may be included in a recombinant expression vector in order to express the desired polypeptide or 
the desired nucleic acid in host cell or in a host organism. Suitable recombinant vectors that 
contain a polynucleotide such as described herein are disclosed elsewhere in the specification. 
Polynucleotide variants 

The invention also relates to variants of the polynucleotides described herein and 
25 fragments thereof. "Variants" of polynucleotides, as the term is used herein, are polynucleotides 
that differ from a reference polynucleotide. Generally, differences are limited so that the 
nucleotide sequences of the reference and the variant are closely similar overall and, in many 
regions, identical. The present invention encompasses both allelic variants and degenerate 
variants. 
30 Allelic variant 

A variant of a polynucleotide may be a naturally occurring variant such as a naturally 
occurring allelic variant, or it may be a variant that is not known to occur naturally. By an "allelic 
variant" is intended one of several alternate forms of a gene occupying a given locus on a 
chromosome of an organism [see Lewin, (1989), Proc. Natl. Acad. Sci. USA 86:9832-8935], the 
35 disclosure of which is incorporated by reference in its entirety. Diploid organisms may be 
homozygous or heterozygous for an allelic form. Non-naturally occurring variants of the 
polynucleotide may be made by art-known mutagenesis techniques, including those applied to 
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polynucleotides, cells or organisms. See, for example, Table HI, which provides sets of related 
cDNAs of the invention, e.g. sets of sequences representing allelic variants of a single gene. 
Degenerate variant 

In addition to the isolated polynucleotides of the present invention, and fragments thereof, 
5 the invention further includes polynucleotides which comprise a sequence substantially different 
from those described above but which, due to the degeneracy of the genetic code, still encode a 
GENSET polypeptide of the present invention. These polynucleotide variants are referred to as 
"degenerate variants" throughout the instant application. That is, all possible polynucleotide 
sequences that encode the GENSET polypeptides of the present invention are contemplated. This 

10 includes the genetic code and species-specific codon preferences known in the art. 

Nucleotide changes present in a variant polynucleotide may be silent, which means that 
they do not alter the amino acids encoded by the polynucleotide. However, nucleotide changes 
may also result in amino acid substitutions, additions, deletions, fusions and truncations in the 
polypeptide encoded by the reference sequence. The substitutions, deletions or additions may 

1 5 involve one or more nucleotides. The variants may be altered in coding or non-coding regions or 
both. Alterations in the coding regions may produce conservative or non-conservative amino acid 
substitutions, deletions or additions. In the context of the present invention, preferred 
embodiments are those in which the polynucleotide variants encode polypeptides which retain 
substantially the same biological properties or activities as the GENSET protein. More preferred 

20 polynucleotide variants are those containing conservative substitutions. 
Similar polynucleotides 

Other embodiments of the present invention provide a purified, isolated or recombinant 
polynucleotide which is at least 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to a 
polynucleotide of the present invention. The above polynucleotides are included regardless of 

25 whether they encode a polypeptide having a GENSET biological activity. This is because even 
where a particular nucleic acid molecule does not encode a polypeptide having activity, one of skill 
in the art would still know how to use the nucleic acid molecule, for instance, as a hybridization 
probe or primer. Uses of the nucleic acid molecules of the present invention that do not encode a 
polypeptide having GENSET activity include, inter alia, isolating a GENSET gene or allelic 

30 variants thereof from a DNA library, and detecting GENSET mRNA expression in biological 
samples suspected of containing GENSET mRNA or DNA, e.g., by Northern Blot or PGR 
analysis. 

The present invention is further directed to polynucleotides having sequences at least 50%. 
60%, 70%, 80%, 90%, 95%, 96%, 97%, 98% or 99% identity to a polynucleotide, where said 
35 polynucleotides do, in fact, encode a polypeptide having a GENSET biological activity. Of course, 
due to the degeneracy of the genetic code, one of ordinary skill in the art will immediately 
recognize that a large number of the polynucleotides at least 50%. 60%, 70%, 80%, 90%, 95%, 
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96%, 97%, 98%, or 99% identical to a polynucleotide selected from the group consisting of 
polynucleotide sequences of the Sequence Listing and those of human cDNA clone inserts of the 
deposited clone pool will encode a polypeptide having biological activity. By a polynucleotide 
having a nucleotide sequence at least, for example, 95% "identical" to a reference nucleotide 
5 sequence of the present invention, it is intended that the nucleotide sequence of the polynucleotide 
is identical to the reference sequence except that the polynucleotide sequence may include up to 
five point mutations per each 100 nucleotides of the reference nucleotide sequence encoding the 
GENSET polypeptide. In other words, to obtain a polynucleotide having a nucleotide sequence at 
least 95% identical to a reference nucleotide sequence, up to 5% of the nucleotides in the reference 

10 sequence may be deleted, inserted, or substituted with another nucleotide. The query sequence 
may be any polynucleotide of the present invention. 
Hybridizing Polynucleotides 

In another aspect, the invention provides an isolated or purified nucleic acid molecule 
comprising a polynucleotide which hybridizes under stringent hybridization conditions to any 

1 5 polynucleotide of the present. Such hybridizing polynucleotides may be of at least any one integer 
between 10 and 10,000 nucleotides in length. 

Of course, a polynucleotide which hybridizes only to polyA+ sequences (such as any 3 1 
terminal polyA+ tract of a cDNA shown in the sequence listing), or to a 5' complementary stretch 
of T (or U) residues, would not be included in the definition of "polynucleotide," since such a 

20 polynucleotide would hybridize to any nucleic acid molecule containing a poly(A) stretch or the 
complement thereof (e.g., practically any double-stranded cDNA clone generated using oligo dT as 
a primer). 

Complementary polynucleotides 

The invention further provides isolated nucleic acid molecules having a nucleotide 

25 sequence fully complementary to any polynucleotide of the invention. 
Polynucleotide fragments 

The present invention is further directed to portions or fragments of the polynucleotides of 
the present invention. Uses for the polynucleotide fragments of the present invention include 
probes, primers, molecular weight markers and for expressing the polypeptide fragments of the 

30 present invention. Fragments include portions of polynucleotides selected from the group 
consisting of a) polynucleotide sequences of the Sequence Listing, b) genomic GENSET 
sequences, c) polynucleotides encoding a polypeptide of the present invention, d) sequences of 
human cDNA clone inserts of the deposited clone pool, and e) polynucleotides encoding the 
polypeptides encoded by the human cDNA clone inserts of the deposited clone pool. Particularly 

35 included in the present invention is a purified or isolated polynucleotide comprising at least 8 
consecutive bases of a polynucleotide of the present invention. In one aspect of this embodiment, 
the polynucleotide comprises at least 10, 12, 15, 18, 20, 25, 28, 30, 35, 40, 50, 75, 100, 150, 200, 
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300, 400, 500, 800, 1000, 1500, or 2000 consecutive nucleotides of a polynucleotide of fee present 
invention. 

In addition to the above preferred polynucleotide sizes, further preferred sub-genuses of 
polynucleotides comprise at least X nucleotides, wherein "X" is defined as any integer between 8 
5 and the integer representing the 3 ' most nucleotide position as set forth in the sequence listing or 
elsewhere herein. Further included as preferred polynucleotides of the present invention are 
polynucleotide fragments at least X nucleotides in length, as described above, that are further 
specified in terms of their 5' and 3 5 position. The 5' and 3' positions are represented by the 
position numbers set forth in the appended sequence listing wherein the 5' most nucleotide is 1 and 

10 the 3' most nucleotide is the last nucleotide for a particular SEQ ID No. For allelic, degenerate and 
other variants, position 1 is defined as the 5' most nucleotide of the ORF, i.e., the nucleotide "A" of 
the start codon with the remaining nucleotides numbered consecutively. Therefore, every 
combination of a 5* and 3' nucleotide position that a polynucleotide fragment of the present 
invention, at least 8 contiguous nucleotides in length, could occupy on a polynucleotide of the 

15 invention is included in the invention as an individual species. The polynucleotide fragments 
specified by 5' and 3' positions can be immediately envisaged and are therefore not individually 
listed solely for the purpose of not unnecessarily lengthening the specification. 

It is noted that the above species of polynucleotide fragments of the present invention may 
alternatively be described by the formula "a to b"; where "a" equals the 5 T most nucleotide position 

20 and "b" equals the 3' most nucleotide position of the polynucleotide; and further where "a" equals 
an integer between 1 and the number of nucleotides of the polynucleotide sequence of the present 
invention minus 8, and where "b" equals an integer between 9 and the number of nucleotides of the 
polynucleotide sequence of the present invention; and where "a" is an integer smaller then M b" by 
at least 8. 

25 The present invention also provides for the exclusion of any species of polynucleotide 

fragments of the present invention specified by 5 5 and 3' positions or sub-genuses of 
polynucleotides specified by size in nucleotides as described above. Any number of fragments 
specified by 5 5 and 3' positions or by size in nucleotides, as described above, may be excluded. 
Preferred excluded fragments include those having substantial homology to repeated sequences 

30 including Alu, L 1 , THE and MER repeats, SSTR sequences or satellite, micro-satellite, and telomeric 
repeats. 

Other preferred fragments of the invention are polynucleotides comprising polynucleotide 
sequences encoding domains of polypeptides. Such fragments may be used to obtain other 
polynucleotides encoding polypeptides having similar domains using hybridization or RT-PCR 
35 techniques. Alternatively, these fragments may be used to express a polypeptide domain which 
may have a specific biological property. 

Another object of the invention is an isolated, purified or recombinant polynucleotide 
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encoding a polypeptide consisting of, consisting essentially of, or comprising a contiguous span of 
at least (any integer between 5 and 1,000 consecutive amino acids in length more preferably at 
least) 5, 6, 8, 10, 12, 15, 20, 25, 30, 35, 40, 50, 60, 75, 100, 150 or 200 consecutive amino. 
The present invention further encompasses any combination of the polynucleotide fragments listed 
5 in this section. 

Oligonucleotide primers and probes 

The present invention also encompasses fragments of GENSET polynucleotides for use as 
primers and probes. Polynucleotides derived from the GENSET genomic and cDNA sequences 
are useful in order to detect the presence of at least a copy of a GENSET polynucleotide or 
10 fragment, complement, or variant thereof in a test sample. 
Structural definition 

Any polynucleotide of the invention may be used as a primer or probe. Particularly 
preferred probes and primers of the invention include isolated, purified, or recombinant 
polynucleotides comprising a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 

15 80, 90, 100, 150, 200, 500, or 1000 nucleotides of a polynucleotide of the present invention. 
For amplification purposes, pairs of primers with approximately the same Tm are 
preferable. Primers may be designed using methods known in the art. Amplification techniques 
that can be used in the context of the present invention include, but are not limited to, the ligase 
chain reaction (LCR) described in EP-A- 320 308, WO 9320227 and EP-A-439 182, the 

20 polymerase chain reaction (PCR, RT-PCR) and techniques such as the nucleic acid sequence based 
amplification (NASBA) described in Guatelli et aL, (1990) Proc. Natl. Acad. Sci. USA 35:273-286 
and in Compton (1991) Nature 350(6313):91-92, Q-beta amplification as described in European 
Patent Application No 4544610, strand displacement amplification as described in Walker, et al. 
(1996), Clin. Chem. 42:9-13 and EP A 684 315 and, target mediated amplification as described in 

25 PCT Publication WO 9322461, the disclosures of which are incorporated by reference in their 
entireties. 

The probes of the present invention are useful for a number of purposes. They can notably 
be used in Southern hybridization to genomic DNA. The probes can also be used to detect PCR 
amplification products. They may also be used to detect mismatches in the GENSET gene or 

30 mRNA using other techniques. They may also be used to in situ hybridization. 

Any of the polynucleotides, primers and probes of the present invention can be conveniently 
immobilized on a solid support. The solid support is not critical and can be selected by one skilled 
in the art. Thus, latex particles, microparticles, magnetic beads, non-magnetic beads (including 
polystyrene beads), membranes (including nitrocellulose strips), plastic tubes, walls of microtiter 

35 wells, glass or silicon chips, sheep (or other suitable animal f s) red blood cells and duracytes are all 
suitable examples. Suitable methods for immobilizing nucleic acids on solid phases include ionic, 
hydrophobic, covalent interactions and the like. A solid support, as used herein, refers to any 
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material which is insoluble, or can be made insoluble by a subsequent reaction. The solid support 
can be chosen for its intrinsic ability to attract and immobilize the capture reagent Alternatively, 
the solid phase can retain an additional receptor which has the ability to attract and immobilize the 
capture reagent. The additional receptor can include a charged substance that is oppositely charged 
5 with respect to the capture reagent itself or to a charged substance conjugated to the capture 
reagent. As yet another alternative, the receptor molecule can be any specific binding member 
which is immobilized upon (attached to) the solid support and which has the ability to immobilize 
the capture reagent through a specific binding reaction. The receptor molecule enables the indirect 
binding of the capture reagent to a solid support material before the performance of the assay or 

10 during the performance of the assay. The solid phase thus can be a plastic, derivatized plastic, 
magnetic or non-magnetic metal, glass or silicon surface of a test tube, microtiter well, sheet, bead, 
microparticle, chip, sheep (or other suitable animal's) red blood cells, duracytes® and other 
configurations known to those of ordinary skill in the art. The polynucleotides of the invention can 
be attached to or immobilized on a solid support individually or in groups of at least 2, 5, 8, 10, 12, 

15 15, 20, or 25 distinct polynucleotides of the invention to a single solid support. In addition, 

polynucleotides other than those of the invention may be attached to the same solid support as one 
or more polynucleotides of the invention. 
Oligonucleotide array 

A substrate comprising a plurality of oligonucleotide primers or probes of the invention 

20 may be used either for detecting or amplifying targeted sequences in GENSET genes, may be used 
for detecting mutations in the coding or in the non-coding sequences of GENSET genes, and may 
also be used to determine GENSET gene expression in different contexts such as in different 
tissues, at different stages of a process (embryo development, disease treatment), and in patients 
versus healthy individuals as described elsewhere in the application. 

25 As used herein, the term "array" means a one dimensional, two dimensional, or 

multidimensional arrangement of nucleic acids of sufficient length to permit specific detection of 
gene expression. For example, the array may contain a plurality of nucleic acids derived from 
genes whose expression levels are to be assessed. The array may include a GENSET genomic 
DNA, a GENSET cDNA, sequences complementary thereto or fragments thereof. Preferably, the 

30 fragments are at least 12, 15, 18, 20, 25, 30, 35, 40 or 50 nucleotides in length. More preferably, 
the fragments are at least 100 nucleotides in length. Even more preferably, the fragments are more 
than 100 nucleotides in length. In some embodiments the fragments may be more than 500 
nucleotides in length. 

Any polynucleotide provided herein may be attached in overlapping areas or at random 
35 locations on the solid support. Alternatively the polynucleotides of the invention may be attached 
in an ordered array wherein each polynucleotide is attached to a distinct region of the solid support 
which does not overlap with the attachment site of any other polynucleotide. Preferably, such an 
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ordered array of polynucleotides is designed to be "addressable'* where the distinct locations are 
recorded and can be accessed as part of an assay procedure. Addressable polynucleotide arrays 
typically comprise a plurality of different oligonucleotide probes that are coupled to a surface of a 
substrate in different known locations. The knowledge of the precise location of each 
5 polynucleotides location makes these "addressable" arrays particularly useful in hybridization 
assays. Any addressable array technology known in the art can be employed with the 
polynucleotides of the invention. One particular embodiment of these polynucleotide arrays is 
known as the Genechips™, and has been generally described in US Patent No. 5,143,854; PCT 
publications WO 90/15070 and 92/10092, which disclosures are hereby incorporated by reference 

10 in their entireties. These arrays may generally be produced using methods known in the art, e.g., 
Fodor et ai 9 (1991) Science 25 1 :767-777, which disclosure is hereby incorporated by reference in 
its entirety. The immobilization of arrays of oligonucleotides on solid supports has been rendered 
possible by the development of a technology generally identified as "Very Large Scale 
Immobilized Polymer Synthesis" (VLSIPS™) in which, typically, probes are immobilized in a 

15 high density array on a solid surface of a chip. Examples of VLSIPS™ technologies are provided 
in US Patents 5,143,854; and 5,412,087 and in PCT Publications WO 90/15070, WO 92/10092 and 
WO 95/1 1995, which disclosures are hereby incorporated by reference in their entireties. In 
designing strategies aimed at providing arrays of nucleotides immobilized on solid supports, 
further presentation strategies known in the art may be used, such as those disclosed in PCT 

20 Publications WO 94/12305, WO 94/11530, WO 97/29212 and WO 97/31256, the disclosures of 
which are incorporated herein by reference in their entireties. 

Consequently, the invention concerns an array of nucleic acid molecules comprising at 
least one polynucleotide of the invention. Preferably, the invention concerns an array of nucleic 
acids comprising at least two polynucleotides of the invention, particularly probes or primers as 

25 described herein. Preferably, the invention concerns an array of nucleic acids comprising at least 
five polynucleotides of the invention, particularly probes or primers as described herein. 
Methods of making the polynucleotides of the invention 

The present invention also comprises methods of making the polynucleotides of the 
invention. Polynucleotides of the invention may be synthesized either enzymatically using 

30 techniques well known to those skilled in the art including amplification or hybridization-based 
methods as described herein, or chemically. 

A variety of chemical methods of synthesizing nucleic acids are known to those skilled in the art. 
la many of these methods, synthesis is conducted on a solid support. Alternatively, 
polynucleotides may be prepared as described in U.S. Patent No. 5,049,656, which disclosure is 
35 hereby incorporated by reference in its entirety. In some embodiments, several polynucleotides 
prepared as described above are ligated together to generate longer polynucleotides having a 
desired sequence. 
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Polypeptides of the invention 
The term "GENSET polypeptides" is used herein to embrace all of the proteins and polypeptides of 
the present invention. The present invention encompasses GENSET polypeptides, including 
recombinant, isolated or purified GENSET polypeptides consisting of: (a) the foil length 
5 polypeptides of even SEQ ID NOs:2-l 12; (b) the full length polypeptides encoded by the clone 
inserts of the deposited clone pool; (c) the epitope-bearing fragments of the polypeptides of even 
SEQ ID NOs:2-l 12; (d) the epitope-bearing fragments of the polypeptides encoded by the clone 
inserts contained in the deposited clone pool; (e) the domains of the polypeptides of even SEQ ID 
NOs:2-l 12; (f) the domains of the polypeptides encoded by the clone inserts contained in the 

10 deposited clone pool; (g) the signal peptides of the polypeptides of even SEQ ID NOs:2-l 12 or 
encoded by the human cDNAs of the deposited clone pool; (h) the mature polypeptides of even 
SEQ ID Nos:2-l 12 or encoded by the human cDNAs of the deposited clone pool; and (i) the allelic 
variant polypeptides of any of the polypeptides of (a)-(f). Other objects of the invention are 
polypeptides encoded by the polynucleotides of the invention as well as fusion polypeptides 

1 5 comprising such polypeptides. 
Polypeptide variants 

The present invention further provides for GENSET polypeptides encoded by allelic and 
splice variants, orthologs, and/or species homologues. Procedures known in the art can be used to 
obtain, allelic variants, splice variants, orthologs, and/or species homologues of polynucleotides 

20 encoding polypeptides of the Sequence Listing and polypeptides encoded by the clone inserts of 
the deposited clone pool, using information from the sequences disclosed herein or the clones 
deposited with the ATCC. 

The polypeptides of the present invention also include polypeptides having an amino acid 
sequence at least 50% identical, more preferably at least 60% identical, and still more preferably 

25 70%, 80%, 90%, 95%, 96%, 97%, 98% or 99% identical to a polypeptide of the present invention. 
By a polypeptide having an amino acid sequence at least, for example, 95% "identical" to a query 
amino acid sequence of the present invention, it is intended that the amino acid sequence of the 
subject polypeptide is identical to the query sequence except that the subject polypeptide sequence 
may include up to five amino acid alterations per each 100 amino acids of the query amino acid 

30 sequence. In other words, to obtain a polypeptide having an amino acid sequence at least 95% 
identical to a query amino acid sequence, up to 5% (5 of 100) of the amino acid residues in the 
subject sequence may be inserted, deleted, (indels) or substituted with another amino acid. 

Further polypeptides of the present invention include polypeptides which have at least 90% 
similarity, more preferably at least 95% similarity, and still more preferably at least 96%, 97%, 

35 98% or 99% similarity to those described above. By a polypeptide having an amino acid sequence 
at least, for example, 95% "similar" to a query amino acid sequence of the present invention, it is 
intended that the amino acid sequence of the subject polypeptide is similar (i.e. contains identical 
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or equivalent amino acid residues) to the query sequence except that the subject polypeptide 
sequence may include up to five amino acid alterations per each 100 amino acids of the query 
amino acid sequence. In other words, to obtain a polypeptide having an amino acid sequence at 
least 95% similar to a query amino acid sequence, up to 5% (5 of 100) of the amino acid residues 
5 in the subject sequence may be inserted, deleted, (indels) or substituted with another non- 
equivalent amino acid. 

These alterations of the reference sequence may occur at the amino or carboxy terminal 
positions of the reference amino acid sequence or anywhere between those terminal positions, 
interspersed either individually among residues in the reference sequence or in one or more 
10 contiguous groups within the reference sequence. The query sequence may be an entire amino 
acid sequence selected from the group consisting of polypeptide sequences of the Sequence Listing 
and those encoded by the clone inserts of the deposited clone pool or any fragment specified as 
described herein. 

The variant polypeptides described herein are included in the present invention regardless of 
15 whether they have their normal biological activity. This is because even where a particular 

polypeptide molecule does not have biological activity, one of skill in the art would still know how 
to use the polypeptide, for instance, as a vaccine or to generate antibodies. Other uses of the 
polypeptides of the present invention that do not have GENSET biological activity include, inter 
alia, as epitope tags, in epitope mapping, and as molecular weight markers on SDS-PAGE gels or 
20 on molecular sieve gel filtration columns using methods known to those of skiU in the art. As 
described below, the polypeptides of the present invention can also be used to raise polyclonal and 
monoclonal antibodies, which are useful in assays for detecting GENSET protein expression or as 
agonists and antagonists capable of enhancing or inhibiting GENSET protein function. Further, 
such polypeptides can be used in the yeast two-hybrid system to "capture" GENSET protein 
25 binding proteins, which are also candidate agonists and antagonists according to the present 
invention (see, e.g., Fields and Song, (1989), Nature, 340: 245-246, which disclosure is hereby 
incorporated by reference in its entirety). 
Preparation of the polypeptides of the invention 

The polypeptides of the present invention can be prepared in any suitable manner known in the art. 

30 Such polypeptides include isolated naturally occurring polypeptides, recombinantly produced 
polypeptides, synthetically produced polypeptides, or polypeptides produced by a combination of 
these methods. The polypeptides of the present invention are preferably provided in an isolated 
form, and may be partially or preferably substantially purified. Consequently, the present 
invention also comprises methods of making the polypeptides of the invention. 

35 Isolation 

From natural sources 

The GENSET proteins of the invention may be isolated from natural sources, including 
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bodily fluids, tissues and cells, whether directly isolated or cultured cells, of humans or non-human 
animals. Methods for extracting and purifying natural proteins are known in the art, and include 
the use of detergents or chaotropic agents to disrupt particles followed by differential extraction 
and separation of the polypeptides by ion exchange chromatography, affinity chromatography, 
5 sedimentation according to density, and gel electrophoresis. See, for example, "Methods in 
Enzymology, Academic Press, 1993" for a variety of methods for purifying proteins, which 
disclosure is hereby incorporated by reference in its entirety. Polypeptides of the invention also 
can be purified from natural sources using antibodies directed against the polypeptides of the 
invention, such as those described herein, in methods which are well known in the art of protein 

10 purification. 

From recombinant sources 

Preferably, the GENSET polypeptides of the invention are recombinantly produced using 
routine expression methods known in the art. The polynucleotide encoding the desired polypeptide 
is operably linked to a promoter into an expression vector suitable for any convenient host. Both 

1 5 eukaryotic and prokaryotic host systems are used in forming recombinant polypeptides. The 
polypeptide is then isolated from lysed cells or from the culture medium and purified to the extent 
needed for its intended use. 

Any polynucleotide of the present invention may be used to express GENSET 
polypeptides. The nucleic acid encoding the GENSET polypeptide to be expressed is operably 

20 linked to a promoter in an expression vector using conventional cloning technology. The GENSET 
insert in the expression vector may comprise the full coding sequence for the GENSET protein or a 
portion thereof. 

Consequently, a further embodiment of the present invention is a method of making a 
polypeptide of the present invention, said method comprising the steps of; 
25 a) obtaining a cDNA comprising a sequence selected from the group consisting of: 

i) the polynucleotide sequences of the Sequence Listing, 

ii) the sequences of human cDNA clone inserts of the deposited clone pool, 

iii) polynucleotide sequences encoding one of the polypeptides of the 
Sequence Listing, and 

30 iv) sequences of polynucleotides encoding a polypeptide which is encoded by 

one of the clone insert of the deposited clone pool; 

b) inserting said cDNA in an expression vector such that the cDNA is operably 
linked to a promoter, and 

c) introducing said expression vector into a host cell whereby said host cell produces 
35 said polypeptide. 

In one aspect of this embodiment, the method further comprises the step of isolating the 
polypeptide. Another embodiment of the present invention is a polypeptide obtainable by the 
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method described in the preceding paragraph. 

The expression vector is any of the mammalian, yeast, insect or bacterial expression systems 
known in the art. Commercially available vectors and expression systems are available from a 
variety of suppliers including Genetics Institute (Cambridge, MA), Stratagene (La Jolla, 
5 California), Promega (Madison, Wisconsin), and Invitrogen (San Diego, Cahfornia). If desired, to 
enhance expression and facilitate proper protein folding, the codon context and codon pairing of 
the sequence is optimized for the particular expression organism in which the expression vector is 
introduced, as explained in U.S. Patent No. 5,082,767, which disclosure is hereby incorporated by 
reference in its entirety. 

10 In one embodiment, the entire coding sequence of a GENSET cDNA and the 3'UTR 

through the poly A signal of the cDNA is operably linked to a promoter in the expression vector. 
Alternatively, if the nucleic acid encoding a portion of the GENSET protein lacks a methionine to 
serve as the initiation site, an initiating methionine can be introduced next to the first codon of the 
nucleic acid using conventional techniques. Similarly, if the insert from the GENSET cDNA lacks 

15 a poly A signal, this sequence can be added to the construct by, for example, splicing out the Poly 
A signal from pSG5 (Stratagene) using Bgll and Sail restriction endonuclease enzymes and 
incorporating it into the mammalian expression vector pXTl (Stratagene). pXTl contains the 
LTRs and a portion of the gag gene from Moloney Murine Leukemia Virus. The position of the 
LTRs in the construct allows efficient stable transfection. The vector includes the Herpes Simplex 

20 Thymidine Kinase promoter and the selectable neomycin gene. 

In another embodiment, it is often advantageous to add to the recombinant polynucleotide 
additional nucleotide sequence which codes for secretory or leader sequences, pro-sequences, 
sequences which aid in purification, such as multiple histidine residues, or an additional sequence 
for stability during recombinant production. 

25 Transfection of a GENSET expression vector into mouse NTH 3T3 cells is but one 

embodiment of introducing polynucleotides into host cells. Introduction of a polynucleotide 
encoding a polypeptide into a host cell can be effected by calcium phosphate transfection, 
DEAE-dextran mediated transfection, cationic lipid-mediated transfection, electroporation, 
transduction, infection, or other methods. Such methods are described in many standard laboratory 

30 manuals, such as Davis et al., (1986) Basic Methods in Molecular Biology, ed., Elsevier Press, 
NY, which disclosure is hereby incorporated by reference in its entirety. It is specifically 
contemplated that the polypeptides of the present invention may in fact be expressed by a host cell 
lacking a recombinant vector or naturally produced by a cell. 

Alternatively, the GENSET polypeptide to be expressed may also be a product of 

35 transgenic animals, i.e., as a component of the milk of transgenic cows, goats, pigs or sheep which 
are characterized by somatic or germ cells containing a nucleotide sequence encoding the protein 
of interest. 



39 



WO 02/094864 



PCT/IB01/01715 



A polypeptide of this invention can be recovered and purified from recombinant cell 
cultures by well-known methods including differential extraction, ammonium sulfate or ethanol 
precipitation, acid extraction, anion or cation exchange chromatography, phosphocellulose 
chromatography, hydrophobic interaction chromatography, affinity chromatography, 
5 hydroxylapatite chromatography and lectin chromatography. See, for example, "Methods in 
Enzymology", supra for a variety of methods for purifying proteins. Most preferably, high 
performance liquid chromatography ("HPLC") is employed for purification. A recombinantly 
produced version of a GENSET polypeptide can be substantially purified using techniques 
described herein or otherwise known in the art, such as, for example, by the one-step method 
10 described in Smith and Johnson (1988) Gene. 67(1):3 1-40, which disclosure is hereby incorporated 
by reference in its entirety. Polypeptides of the invention also can be purified from recombinant 
sources using antibodies directed against the polypeptides of the invention, such as those described 
herein, in methods which are well known in the art of protein purification. 

Preferably, the recombinantly expressed GENSET polypeptide is purified using standard 
15 immunochromatography techniques such as the one described in the section entitled 

'Tmmunoaffinity Chromatography". In such procedures, a solution containing the protein of 
interest, such as the culture medium or a cell extract, is applied to a column having antibodies 
against the protein attached to the chromatography matrix. The recombinant protein is allowed to 
bind the immunochromatography column. Thereafter, the column is washed to remove non- 
20 specifically bound proteins. The specifically bound secreted protein is then released from the 
column and recovered using standard techniques. 

Depending upon the host employed in a recombinant production procedure, the 
polypeptides of the present invention may be glycosylated or may be non-glycosylated. In 
addition, polypeptides- of the invention may also include an initial modified methionine residue, in 
25 some cases as a result of host-mediated processes. Thus, it is well known in the art that the 

N-terminal methionine encoded by the translation initiation codon generally is removed with high 
efficiency from any protein after translation in all eukaryotic cells. While the N-terminal 
methionine on most proteins also is efficiently removed in most prokaryotes, for some proteins, 
this prokaryotic removal process is inefficient, depending on the nature of the amino acid to which 
30 the N-terminal methionine is covalently linked. Thus, specifically included as an aspect of the 
invention are polypeptides of the present invention lacking the amino terminal methionine. 
From chemical synthesis 

In addition, polypeptides of the invention, especially short protein fragments, can be 
chemically synthesized using techniques known in the art [See, e.g., Creighton (1983), Proteins: 
35 Structures and Molecular Principles, W.H. Freeman & Co. 2nd Ed., T. E., New York; and 

Hunkapiller et al> (1984) Nature. 3 10(5973):105-1 1], which disclosures are hereby incorporated 
by reference in their entireties. For example, a polypeptide corresponding to a fragment of a 
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polypeptide sequence of the invention can be synthesized by use of a peptide synthesizer. 
Alternatively, the methods described in U.S. Patent No. 5,049,656, which disclosure is hereby 
incorporated by reference in its entirety, may be used. 

Furthermore, if desired, nonclassical amino acids or chemical amino acid analogs can be 
5 introduced as a substitution or addition into the polypeptide sequence. Non-classical amino acids 
include, but are not limited to, to the D-isomers of the common amino acids, 2,4-diaminobutyric 
acid, a-amino isobutyric acid, 4-aminobutyric acid, Abu, 2-amino butyric acid, g-Abu, e-Ahx, 6- 
amino hexanoic acid, Aib, 2-amino isobutyric acid, 3-amino propionic acid, ornithine, norleucine, 
norvaline, hydroxyproline, sarcosine, citrulline, homocitrulline, cysteic acid, t-butylglycine, t- 
10 butylalanine, phenylglycine, cyclohexylalanine, b-alanine, fluoroamino acids, designer amino acids 
such as b-methyl amino acids, Ca-methyl amino acids, Na-methyl amino acids, and amino acid 
analogs in general. Furthermore, the amino acid can be D (dextrorotary) or L (levorotary). 
Modifications 

The invention encompasses polypeptides which are differentially modified during or after 

15 translation, e.g., by glycosylation, acetylation, phosphorylation, amidation, derivatization by 
known protecting/blocking groups, proteolytic cleavage, linkage to an antibody molecule or other 
cellular ligand, etc. Any of numerous chemical modifications may be carried out by known 
techniques, including, but not limited to, specific chemical cleavage by cyanogen bromide, trypsin, 
chymotrypsin, papain, V8 protease, NaBH4; acetylation, formylation, oxidation, reduction; 

20 metabolic synthesis in the presence of tunicamycin; etc. 

Additional post-translational modifications encompassed by the invention include, for 
example, e.g., N-linked or O-linked carbohydrate chains, processing of N-terminal or C-terminal 
ends), attachment of chemical moieties to the amino acid backbone, chemical modifications of 
N-linked or O-linked carbohydrate chains, and addition or deletion of an N-terminal methionine 

25 residue as a result of prokaryotic host cell expression. The polypeptides may also be modified with 
a detectable label, such as an en2ymatic, fluorescent, isotopic or affinity label to allow for 
detection and isolation of the protein. 

Also provided by the invention are chemically modified derivatives of the polypeptides of 
the invention which may provide additional advantages such as increased solubility, stability and 

30 circulating time of the polypeptide, or decreased immunogenicity. See U.S. Patent No: 4,179,337. 
The chemical moieties for derivatization may be selected. See, U.S. Patent No: 4,179,337 which 
disclosure is hereby incorporated by reference in its entirety. The chemical moieties for 
derivatization may be selected from water soluble polymers such as polyethylene glycol, ethylene 
glycol/propylene glycol copolymers, carboxymethylcellulose, dextran, polyvinyl alcohol and the 

35 like. The polypeptides may be modified at random positions within the molecule, or at 

predetermined positions within the molecule and may include one, two, three or more attached 
chemical moieties. 
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The polymer may be of any molecular weight, and may be branched or unbranched. For 
polyethylene glycol, the preferred molecular weight is between about 1 kDa and about 100 kDa 
(the term "about" indicating that in preparations of polyethylene glycol, some molecules will 
weigh more, some less, than the stated molecular weight) for ease in handling and manufacturing. 
5 Other sizes may be used, depending on the desired therapeutic profile (e.g., the duration of 

sustained release desired, the effects, if any on biological activity, the ease in handling, the degree 
or lack of antigenicity and other known effects of the polyethylene glycol to a therapeutic protein 
or analog). 

The polyethylene glycol molecules (or other chemical moieties) should be attached to the 

10 protein with consideration of effects on functional or antigenic domains of the protein. There are a 
number of attachment methods available to those skilled in the art, e.g., EP 0 401 384, (coupling 
PEG to G-CSF), and Malik et ai, (1992), Exp. Hematol. 20:1028-1035 (reporting pegylation of 
GM-CSF using tresyl chloride), which disclosures are hereby incorporated by reference in their 
entireties. For example, polyethylene glycol may be covalently bound through amino acid residues 

15 via a reactive group, such as, a free amino or carboxyl group. Reactive groups are those to which 
an activated polyethylene glycol molecule may be bound. The amino acid residues having a free 
amino group may include lysine residues and the N-terminal amino acid residues; those having a 
free carboxyl group may include aspartic acid residues glutamic acid residues and the C-terminal 
amino acid residue. Sulfhydryl groups may also be used as a reactive group for attaching the 

20 polyethylene glycol molecules. Preferred for therapeutic purposes is attachment at an amino 
group, such as attachment at the N-terminus or lysine group. 

One may specifically desire proteins chemically modified at the N-terminus. Using 
polyethylene glycol as an illustration of the present composition, one may select from a variety of 
polyethylene glycol molecules (by molecular weight, branching, etc.), the proportion of 

25 polyethylene glycol molecules to protein (polypeptide) molecules in the reaction mix, the type of 
pegylation reaction to be performed, and the method of obtaining the selected N-terminally 
pegylated protein. The method of obtaining the N-terminally pegylated preparation (i.e., 
separating this moiety from other monopegylated moieties if necessary) may be by purification of 
the N-terminally pegylated material from a population of pegylated protein molecules. Selective 

30 proteins chemically modified at the N-terminus modification may be accomplished by reductive 
alkylation, which exploits differential reactivity of different types of primary amino groups (lysine 
versus the N-terminal) available for derivatization in a particular protein. Under the appropriate 
reaction conditions, substantially selective derivatization of the protein at the N-terminus with a 
carbonyl group containing polymer is achieved. 

35 Multimerization 

The polypeptides of the invention may be in monomers or multimers (i.e., dimers, trimers, 
tetramers and higher multimers). Accordingly, the present invention relates to monomers and 
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multimers of the polypeptides of the invention, their preparation, and compositions containing 
them. In specific embodiments, the polypeptides of the invention are monomers, dimers, trimers 
or tetramers. In additional embodiments, the multimers of the invention are at least dimers, at least 
trimers, or at least tetramers. 

5 Multimers encompassed by the invention may be homomers or heteromers. As used 

herein, the term "homomer", refers to a multimer containing only polypeptides corresponding to 
the amino acid sequences of the Sequence Listing or encoded by the human cDNA clone inserts of 
the deposited clone pool (including fragments, variants, splice variants, and fusion proteins, 
corresponding to these polypeptides as described herein). These homomers may contain 

10 polypeptides having identical or different amino acid sequences. In a specific embodiment, a 
homomer of the invention is a multimer containing only polypeptides having an identical amino 
acid sequence. In another specific embodiment, a homomer of the invention is a multimer 
containing polypeptides having different amino acid sequences. In specific embodiments, the 
multimer of the invention is a homodimer (e.g., containing polypeptides having identical or 

15 different amino acid sequences) or a homotrimer {e.g., containing polypeptides having identical 
and/or different amino acid sequences). In additional embodiments, the homomenc multimer of 
the invention is at least a homodimer, at least a homotrimer, or at least a homotetramer. 

As used herein, the term "heteromer" refers to a multimer containing one or more 
heterologous polypeptides (le., polypeptides of different proteins) in addition to the polypeptides 

20 of the invention. In a specific embodiment, the multimer of the invention is a heterodimer, a 
heterotrimer, or a heterotetramer. In additional embodiments, the heteromeric multimer of the 
invention is at least a heterodimer, at least a heterotrimer, or at least a heterotetramer. 

Multimers of the invention may be the result of hydrophobic, hydrophilic, ionic and/or 
covalent associations and/or may be indirectly linked, by for example, liposome formation. Thus, 

25 in one embodiment, multimers of the invention, such as, for example, homodimers or homotrimers, 
are formed when polypeptides of the invention contact one another in solution. In another 
embodiment, heteromultimers of the invention, such as, for example, heterotrimers or 
heterotetramers, are formed when polypeptides of the invention contact antibodies to the 
polypeptides of the invention (including antibodies to the heterologous polypeptide sequence in a 

30 fusion protein of the invention) in solution. In other embodiments, multimers of the invention are 
formed by covalent associations with and/or between the polypeptides of the invention. Such 
covalent associations may involve one or more amino acid residues contained in the polypeptide 
sequence (e.g., that recited in the sequence listing, or contained in the polypeptide encoded by a 
deposited clone). In one instance, the covalent associations are cross-linking between cysteine 

35 residues located within the polypeptide sequences, which interact in the native (i.e., naturally 
occurring) polypeptide. In another instance, the covalent associations are the consequence of 
chemical or recombinant manipulation. Alternatively, such covalent associations may involve one 
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or more amino acid residues contained in the heterologous polypeptide sequence in a fusion 
protein of the invention. 

In one example, covalent associations are between the heterologous sequence contained in 
a fusion protein of the invention (see, e.g., US Patent Number 5,478,925, which disclosure is 
5 hereby incorporated by reference in its entirety). In a specific example, the covalent associations 
are between the heterologous sequence contained in an Fc fusion protein of the invention (as 
described herein). In another specific example, covalent associations of fusion proteins of the 
invention are between heterologous polypeptide sequence from another protein that is capable of 
forming covalently associated multimers, such as for example, oseteoprotegerin (see, e.g., 

10 International Publication No: WO 98/49305, the contents of which are herein incorporated by 
reference in its entirety). In another embodiment, two or more polypeptides of the invention are 
joined through peptide linkers. Examples include those peptide linkers described in U.S. Pat. No. 
5,073,627 (hereby incorporated by reference). Proteins comprising multiple polypeptides of the 
invention separated by peptide linkers may be produced using conventional recombinant DNA 

15 technology. 

Another method for preparing multimer polypeptides of the invention involves the use of 
polypeptides of the invention fused to a leucine zipper or isoleucine zipper polypeptide sequence. 
Leucine zipper and isoleucine zipper domains are polypeptides that promote multimerization of the 
proteins in which they are found. Leucine zippers were originally identified in several 

20 DNA-binding proteins, and have since been found in a variety of different proteins [Landschulz ei 
al., (1988), Science. 240:1759]. Among the known leucine zippers are naturally occurring 
peptides and derivatives thereof that dimerize or trimerize. Examples of leucine zipper domains 
suitable for producing soluble multimeric proteins of the invention are those described in PCT 
application WO 94/10308, hereby incorporated by reference. Recombinant fusion proteins 

25 comprising a polypeptide of the invention fused to a polypeptide sequence that dimerizes or 
trimerizes in solution are expressed in suitable host cells, and the resulting soluble multimeric 
fusion protein is recovered from the culture supernatant using techniques known in the art. 

Trimeric polypeptides of the invention may offer the advantage of enhanced biological 
activity. Preferred leucine zipper moieties and isoleucine moieties are those that preferentially 

30 form trimers. One example is a leucine zipper derived from lung surfactant protein D (SPD), as 
described in Hoppe et al, (1994), FEBS Letters. 344:191 and in U.S. patent application Ser. No. 
08/446,922, which disclosure is hereby incorporated by reference in its entirety. Other peptides 
derived from naturally occurring trimeric proteins may be employed in preparing trimeric 
polypeptides of the invention. In another example, proteins of the invention are associated by 

35 interactions between Flag® polypeptide sequence contained in fusion proteins of the invention 
containing Flag® polypeptide sequence. In a further embodiment, associations proteins of the 
invention are associated by interactions between heterologous polypeptide sequence contained in 
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Flag® fusion proteins of the invention and anti Flag® antibody. 

The multimers of the invention may be generated using chemical techniques known in the 
art. For example, polypeptides desired to be contained in the multimers of the invention may be 
chemically cross-linked using linker molecules and linker molecule length optimization techniques 
5 known in the art (see, e.g., US Patent Number 5,478,925, which is herein incorporated by reference 
in its entirety). Additionally, multimers of the invention may be generated using techniques known 
in the art to form one or more inter-molecule cross-links between the cysteine residues located 
within the sequence of the polypeptides desired to be contained in the multimer (see, e.g., US 
Patent Number 5,478,925, which is herein incorporated by reference in its entirety). Further, 

10 polypeptides of the invention may be routinely modified by the addition of cysteine or biotin to the 
C terminus or N-terminus of the polypeptide and techniques known in the art may be applied to 
generate multimers containing one or more of these modified polypeptides (see, e.g., US Patent 
Number 5,478,925, which is herein incorporated by reference in its entirety). Additionally, other 
techniques known in the art may be applied to generate liposomes containing the polypeptide 

15 components desired to be contained in the multimer of the invention (see, e.g., US Patent Number 
5,478,925, which is herein incorporated by reference in its entirety). 

Alternatively, multimers of the invention may be generated using genetic engineering techniques 
known in the art. In one embodiment, polypeptides contained in multimers of the invention are 
produced recombinantly using fusion protein technology described herein or otherwise known in 

20 the art (see, e.g., US Patent Number 5,478,925, which is herein incorporated by reference in its 
entirety). In a specific embodiment, polynucleotides coding for a homodimer of the invention are 
generated by ligating a polynucleotide sequence encoding a polypeptide of the invention to a 
sequence encoding a linker polypeptide and then further to a synthetic polynucleotide encoding the 
translated product of the polypeptide in the reverse orientation from the original C-terminus to the 

25 N-terminus (lacking the leader sequence) (see, e.g., US Patent Number 5,478,925, which is herein 
incorporated by reference in its entirety). In another embodiment, recombinant techniques 
described herein or otherwise known in the art are applied to generate recombinant polypeptides of 
the invention which contain a transmembrane domain (or hydrophobic or signal peptide) and 
which can be incorporated by membrane reconstitution techniques into liposomes (see, e.g., US 

30 Patent Number 5,478,925, which is herein incorporated by reference in its entirety). 
Mutated polypeptides 

To improve or alter the characteristics of GENSET polypeptides of the present invention, 
protein engineering may be employed. Recombinant DNA technology known to those skilled in 
the art can be used to create novel mutant proteins or muteins including single or multiple amino 

35 acid substitutions, deletions, additions, or fusion proteins. Such modified polypeptides can show, 
e.g., increased/decreased biological activity or increased/decreased stability. In addition, they may 
be purified in higher yields and show better solubility than the corresponding natural polypeptide, 
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at least under certain purification and storage conditions. Further, the polypeptides of the present 
invention may be produced as multimers including dimers, trimers and tetramers. Multimerization 
may be facilitated by linkers or recombinantly though heterologous polypeptides such as Fc 
regions. 
5 N- and C-terminal deletions 

It is known in the art that one or more amino acids may be deleted from the N-terminus or 
C-terminus without substantial loss of biological function. [See, e.g., Ron et al y (1993), Biol 
Chem., 268 2984-2988.] Accordingly, the present invention provides polypeptides having one or 
more residues deleted from the amino terminus. Similarly, many examples of biologically 
10 functional C-terminal deletion mutants are known (see, e.g. t Dobeli, et al. 1988). Accordingly, the 
present invention provides polypeptides having one or more residues deleted from the carboxy 
terminus. The invention also provides polypeptides having one or more amino acids deleted from 
both the amino and the carboxyl termini as described below. 
Other mutations 

1 5 Other mutants in addition to N- and C-terminal deletion forms of the protein discussed 

above are included in the present invention. Thus, the invention further includes variations of the 
GENSET polypeptides which show substantial GENSET polypeptide activity. Such mutants 
include deletions, insertions, inversions, repeats, and substitutions selected according to general 
rules known in the art so as to have little effect on activity. 

20 There are two main approaches for studying the tolerance of an amino acid sequence to 

change [see, Bowie et aL, (1994), Science, 247:1306-1310, which disclosure is hereby 
incorporated by reference in its entirety]. The first method relies on the process of evolution, in 
which mutations are either accepted or rejected by natural selection. The second approach uses 
genetic engineering to introduce amino acid changes at specific positions of a cloned gene and 

25 selections or screens to identify sequences that maintain functionality. These studies have revealed 
that proteins are surprisingly tolerant of amino acid substitutions. 

Typically seen as conservative substitutions are the replacements, one for another, among 
the aliphatic amino acids Ala, Val, Leu and Phe; interchange of the hydroxyl residues Ser and Thr, 
exchange of the acidic residues Asp and Glu, substitution between the amide residues Asn and Gin, 

30 exchange of the basic residues Lys and Arg and replacements among the aromatic residues Phe, 
Tyr. Thus, the polypeptide of the present invention may be, for example: (i) one in which one or 
more of the amino acid residues are substituted with a conserved or non-conserved amino acid 
residue (preferably a conserved amino acid residue) and such substituted amino acid residue may 
or may not be one encoded by the genetic code; or (ii) one in which one or more of the amino acid 

35 residues includes a substituent group; or (iii) one in which the GENSET polypeptide is fused with 
another compound, such as a compound to increase the half-life of the polypeptide (for example, 
polyethylene glycol); or (iv) one in which the additional amino acids are fused to the above form of 
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the polypeptide, such as an IgG Fc fusion region peptide or leader or secretory sequence or a 
sequence which is employed for purification of the above form of the polypeptide or a pro-protein 
sequence. 

Thus, the GENSET polypeptides of the present invention may include one or more amino 
5 acid substitutions, deletions, or additions, either from natural mutations or human manipulation. 
As indicated, changes are preferably of a minor nature, such as conservative amino acid 
substitutions that do not significantly affect the folding or activity of the protein. The following 
groups of amino acids represent equivalent changes: (1) Ala, Pro, Gly, Glu, Asp, Gin, Asn, Ser, 
Thr, (2) Cys, Ser, Tyr, Thr; (3) Val, He, Leu, Met, Ala, Phe; (4) Lys, Arg, His; (5) Phe, Tyr, Trp, 
10 His. 

Furthermore, GENSET polypeptides of the present invention may include one or more 
amino acid substitutions that mimic modified amino acids. An example of this type of substitution 
includes replacing amino acids that are capable of being phosphorylated (e.g., serine, threonine, or 
tyrosine) with a negatively charged amino acid that resembles the negative charge of the 

15 phosphorylated amino acid (e.g., aspartic acid or glutamic acid). Also included is substitution of 
amino acids that are capable of being modified by hydrophobic groups (e.g., arginine) with amino 
acids carrying bulky hydrophobic side chains, such as tryptophan or phenylalanine. Therefore, a 
specific embodiment of the invention includes GENSET polypeptides that include one or more 
amino acid substitutions that mimic modified amino acids at positions where amino acids that are 

20 capable of being modified are normally positioned. Further included are GENSET polypeptides 
where any subset of modifiable amino acids are substituted. For example, a GENSET polypeptide 
that includes three serine residues may be substituted at any one, any two, or all three of said 
serines. Furthermore, any GENSET polypeptide amino acid capable of being modified may be 
excluded from substitution with a modification-mimicking amino acid. 

25 A specific embodiment of a modified GENSET peptide molecule of interest according to 

the present invention, includes, but is not limited to, a peptide molecule which is resistant to 
proteolysis, is a peptide in which the -CONH- peptide bond is modified and replaced by a 
(CH2NH) reduced bond, a (NHCO) retro inverso bond, a (CH2-0) methylene-oxy bond, a (CH2- 
S) thiomethylene bond, a (CH2CH2) carba bond, a (CO-CH2) cetomethylene bond, a (CHOH- 

30 CH2) hydroxyethylene bond), a (N-N) bound, a E-alcene bond or also a -CH=CH- bond. The 
invention also encompasses a human GENSET polypeptide or a fragment or a variant thereof in 
which at least one peptide bond has been modified as described above. * 

Amino acids in the GENSET proteins of the present invention that are essential for 
function can be identified by methods known in die art, such as site-directed mutagenesis or 

35 alanine-scanning mutagenesis [see, e.g., Cunningham et ah (1989), Science 244:1081-1085, which 
disclosure is hereby incorporated by reference in its entirety]. Of special interest are substitutions 
of charged amino acids with other charged or neutral amino acids which may produce proteins 
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with highly desirable improved characteristics, such as less aggregation. Aggregation may not 
only reduce activity but also be problematic when preparing pharmaceutical formulations, because 
aggregates can be immunogenic. [See, e.g., Pinckard et al. y (1967), Clin. Exp. Immunol 2:331- 
340; Robbins et al. 9 (1987), Diabetes. 36:838-845; and Cleland et al., (1993), Crit. Rev. 
5 Therapeutic Drug Carrier Systems. 10:307-377.] 

A further embodiment of the invention relates to a polypeptide which comprises the amino 
acid sequence of a GENSET polypeptide having an amino acid sequence which contains at least 
any one integer from 1 to 50 of conservative amino acid substitutions. Further included are 
polypeptides that contain not more than 40 conservative amino acid substitutions, not more than 30 

10 conservative amino acid substitutions, and not more than 20 conservative amino acid substitutions. 
Also provided are polypeptides which comprise the etmino acid sequence of a GENSET 
polypeptide, having at least one, but not more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 conservative amino 
acid substitutions. Further provided are conservative amino acid substitutions at any appropriate 
position or combination of appropriate positions whereby all possible species are included as 

15 embodiments of the present invention. Each conservative substitution or combination of 
substitutions may also be excluded. 
Polypeptide fragments 
Structural definition 

The present invention is further directed to fragments of the polypeptides of the present 

20 invention. More specifically, the present invention embodies purified, isolated, and recombinant 
polypeptides comprising at least any one integer between 6 and 1000 (or the length of the 
polypeptides amino acid residues minus 1 if the length is less than 1000) of consecutive amino acid 
residues. Preferably, the fragments are at least 6, preferably at least 8 to 10, more preferably 12, 
15, 20, 25, 30, 35, 40, 50, 60, 75, 100, 125, 150, 175, 200, 225, 250, 275, or 300 consecutive amino 

25 acids of a polypeptide of the present invention. 

Li addition to the above polypeptide fragments, further preferred sub-genuses of 
polypeptides comprise at least X amino acids, wherein "X" is defined as any integer between 6 and 
the integer representing the C-terminal amino acid of the polypeptide of the present invention 
including the polypeptide sequences of the sequence listing below. Further included are species of 

30 polypeptide fragments at least 6 amino acids in length, as described above, that are further 

specified in terms of their N-terminal and C-terminal positions. However, included in the present 
invention as individual species are all polypeptide fragments, at least 6 amino acids in length, as 
described above, and may be particularly specified by a N-terminal and C-terminal position. That 
is, every combination of a N-terminal and C-terminal position that a fragment at least 6 contiguous 

35 amino acid residues in length could occupy, on any given amino acid sequence of the sequence 
listing or of the present invention is included in the present invention 

Further preferred polypeptide fragments comprising amino acids of the sequences of the 
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EVEN numbered SEQ ID NOs. of the Sequence listing, and polynucleotides encoding the same, 
are selected from the group consisting of amino acids consecutively numbered from 1-6, 1-7, 1-8, 
1-9, 1-10, 1-11, 1-12, 1-13, 1-14, 1-15, 1-16, 1-17, 1-18, 1-19, 1-20, 1-21, 1-22, 1-23, 1-24, 1-25, 
1-26, 1-27, 1-28, 1-29, 1-30, 1-31, 1-32, 1-33, 1-34, 1-35, 1-36, 1-37, 1-38, 1-39, 1-40, 1-41, 1-42, 
5 1-43, 1-44, 1-45, 1-46, 1-47, 1-48, 1-49, 1-50, 1-51, 1-52, 1-53, 1-54, 1-55, 1-56, 1-57, 1-58, 1-59, 
1-60, 1-61, 1-62, 1-63, 1-64, 1-65, 1-66, 1-67, 1-68, 1-69, 1-70, 1-71, 1-72, 1-73, 1-74, 1-75, 1-76, 
1-77, 1-78, 1-79, 1-80, 1-81, 1-82, 1-83, 1-84, 1-85, 1-86, 1-87, 1-88, 1-89, 1-90, 1-91, 1-92, 1-93, 
1-94, 1-95, 1-96, 1-97, 1-98, 1-99, 1-100, 1-101, 1-102, 1-103, 1-104, 1-105, 1-106, 1-107, 1-108, 
1-109, 1-110, 1-111, 1-112, 1-113, 1-114, 1-115, 1-116, 1-117, 1-118, 1-119, 1-120, 1-121, 1-122, 

10 1-123, 1-124, 1-125, 1-126, 1-127, 1-128, 1-129, 1-130, 1-131, 1-132, 1-133, 1-134, 1-135, 1-136, 
1-137, 1-138, 1-139, 1-140, 1-141, 1-142, 1-143, 1-144, 1-145, 1-146, 1-147, 1-148, 1-149, 1-150, 
1-151, 1-152, 1-153, 1-154, 1-155, 1-156, 1-157, 1-158, 1-159, 1-160, 1-161, 1-162, 1-163, 1-164, 
1-165, 1-166, 1-167, 1-168, 1-169, 1-170, 1-171, 1-172, 1-173, 1-174, 1-175, 1-176, 1-177, 1-178, 
1-179, 1-180, 1-181, 1-182, 1-183, 1-184, 1-185, 1-186, 1-187, 1-188, 1-189, 1-190, 1-191, 1-192, 

15 1-193, 1-194, 1-195, 1-196, 1-197, 1-198, 1-199, 1-200, 1-201, 1-202, 1-203, 1-204, 1-205, 1-206, 
1-207, 1-208, 1-209, 1-210, 1-211, 1-212, 1-213, 1-214, 1-215, 1-216, 1-217, 1-218, 1-219, 1-220, 
1-221, 1-222, 1-223, 1-224, 1-225, 1-226, 1-227, 1-228, 1-229, 1-230, 1-231, 1-232, 1-233, 1-234, 
1-235, 1-236, 1-237, 1-238, 1-239, 1-240, 1-241, 1-242, 1-243, 1-244, 1-245, 1-246, 1-247, 1-248, 
1-249, 1-250, 1-251, 1-252, 1-253, 1-254, 1-255, 1-256, 1-257, 1-258, 1-259, 1-260, 1-261, 1-262, 

20 1-263, 1-264, 1-265, 1-266, 1-267, 1-268, 1-269, 1-270, 1-271, 1-272, 1-273, 1-274, 1-275, 1-276, 
1-277, 1-278, 1-279, 1-280, 1-281, 1-282, 1-283, 1-284, 1-285, 1-286, 1-287, 1-288, 1-289, 1-290, 
1-291, 1-292, 1-293, 1-294, 1-295, 1-296, 1-297, 1-298, 1-299, 1-300, 1-301, 1-302, 1-303, 1-304, 
1-305, 1-306, 1-307, 1-308, 1-309, 1-310, 1-311, 1-312, 1-313, 1-314, 1-315, 1-316, 1-317, 1-318, 
1-319, 1-320, 1-321, 1-322, 1-323, 1-324, 1-325, 1-326, 1-327, 1-328, 1-329, 1-330, 1-331, 1-332, 

25 1-333, 1-334, 1-335, 1-336, 1-337, 1-338, 1-339, 1-340, 1-341, 1-342, 1-343, 1-344, 1-345, 1-346, 
1-347, 1-348, 1-349, 1-350, 1-351, 1-352, 1-353, 1-354, 1-355, 1-356, 1-357, 1-358, 1-359, 1-360, 
1-361, 1-362, 1-363, 1-364, 1-365, 1-366, 1-367, 1-368, 1-369, 1-370, 1-371, 1-372, 1-373, 1-374, 
1-375, 1-376, 1-377, 1-378, 1-379, 1-380, 1-381, 1-382, 1-383, 1-384, 1-385, 1-386, 1-387, 1-388, 
1-389, 1-390, 1-391, 1-392, 1-393, 1-394, 1-395, 1-396, 1-397, 1-398, 1-399, 1400, 1-401, 1-402, 

30 1-403, 1-404, 1-405, 1-406, 1-407, 1-408, 1-409, 1-410, Mil, 1-412, 1-413, 1414, 1-415, 1-416, 
1-417, 1-418, 1419, 1-420, 1-421, 1-422, 1-423, 1-424, 1-425, 1-426, 1-427, 1428, 1429, 1-430, 
1-431, 1-432, 1-433, 1-434, 1-435, 1-436, 1-437, 1438, 1439, 1440, 1441, 1442, 1443, 1444, 
I-445, 1-446, 1447, 1448, 1449, 1450, 1451, 1452, 1453, 1454, 1455, 1456, 1457, 1458, 
1459, 1460, 1461, 1462, 1463, 1464, 1465, 1466, 1467, 1468, 1469, 1470, 1471, 1472, 

35 1473, 1474, 1475, 1476, 1477, 1478, 1479, 1480, 1481, 1482, 1483, 1484, 1485, 1486, 
1-487, 1488, 1489, 1490, 1491, 1492, 1493, 1494, 1495, 1496, 1497, 1498, 1499, 1-500, 
1-501, 1-502, 1-503, 1-504, 1-505, 1-506, 1-507, 1-508, 1-509, 1-510, 1-511, 1-512, 1-513, 1-514, 
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1-515, 1-516, 1-517, 1-518, 1-519, 1-520, 1-521, 1-522, 1-523, 1-524, 1-525, 1-526, 1-527, 1-528, 
1-529, 1-530, 1-531, 1-532, 1-533, 1-534, 1-535, 1-536, 1-537, 1-538, 1-539, 1-540, 1-541, 1-542, 
1-543, 1-544, 1-545, 1-546, 1-547, 1-548, 1-549, 1-550, 1-551, 1-552, 1-553, 1-554, 1-555, 1-556, 
1-557, 1-558, 1-559, 1-560, 1-561, 1-562, 1-563, 1-564, 1-565, 1-566, 1-567, 1-568, 1-569, 1-570, 
5 1-571, 1-572, 1-573, 1-574, 1-575, 1-576, 1-577, 1-578, 1-579, 1-580, 1-581, 1-582, 1-583, 1-584, 
1-585, 1-586, 1-587, 1-588, 1-589, 1-590, 1-591, 1-592, 1-593, 1-594, 1-595, 1-596, 1-597, 1-598, 
1-599, 1-600, 1-601, 1-602, 1-603, 1-604, 1-605, 1-606, 1-607, 1-608, 1-609, 1-610, 1-611, 1-612, 
1-613, 1-614, 1-615, 1-616, 1-617, 1-618, 1-619, 1-620, 1-621, 1-622, 1-623, 1-624, 1-625, 1-626, 
1-627, 1-628, 1-629, 1-630, 1-631, 1-632, 1-633, 1-634, 1-635, 1-636, 1-637, 1-638, 1-639, 1-640, . 

10 1-641, 1-642, 1-643, 1-644, 1-645, 1-646, 1-647, 1-648, 1-649, 1-650, 1-651, 1-652, 1-653, 1-654, 
1-655, 1-656, 1-657, 1-658, 1-659, 1-660, 1-661, 1-662, 1-663, 1-664, 1-665, 1-666, 1-667, 1-668, 
1-669, 1-670, 1-671, 1-672, 1-673, 1-674, 1-675, 1-676, 1-677, 1-678, 1-679, 1-680, 1-681, 1-682, 
1-683, 1-684, 1-685, 1-686, 1-687, 1-688, 1-689, 1-690, 1-691, 1-692, 1-693, 1-694, 1-695, 1-696, 
1-697, 1-698, 1-699, 1-700, 1-701, 1-702, 1-703, 1-704, 1-705, 1-706, 1-707, 1-708, 1-709, 1-710, 

15 1-711, 1-712, 1-713, 1-714, 1-715, 1-716, 1-717, 1-718, 1-719, 1-720, 1-721, 1-722, 1-723, 1-724, 
1-725, 1-726, 1-727, 1-728, 1-729, 1-730, 1-731, 1-732, 1-733, 1-734, 1-735, 1-736, 1-737, 1-738, 
1-739, 1-740, 1-741, 1-742, 1-743, 1-744, 1-745, 1-746, 1-747, 1-748, 1-749, 1-750, 1-751, 1-752, 
1-753, 1-754, 1-755, 1-756, 1-757, 1-758, 1-759, 1-760, 1-761, 1-762, 1-763, 1-764, 1-765, 1-766, 
1-767, 1-768, 1-769, 1-770, 1-771, 1-772, 1-773, 1-774, 1-775, 1-776, 1-777, 1-778, 1-779, 1-780, 

20 1-781, 1-782, 1-783, 1-784, 1-785, 1-786, 1-787, 2-787, 3-787, 4-787, 5-787, 6-787, 7-787, 8-787, 
9-787, 10-787, 11-787, 12-787, 13-787, 14-787, 15-787, 16-787, 17-787, 18-787, 19-787, 20-787, 
21-787, 22-787, 23-787, 24-787, 25-787, 26-787, 27-787, 28-787, 29-787, 30-787, 31-787, 32-787, 
33-787, 34-787, 35-787, 36-787, 37-787, 38-787, 39-787, 40-787, 41-787, 42-787, 43-787, 44-787, 
45-787, 46-787, 47-787, 48-787, 49-787, 50-787, 51-787, 52-787, 53-787, 54-787, 55-787, 56-787, 

25 57-787, 58-787, 59-787, 60-787, 61-787, 62-787, 63-787, 64-787, 65-787, 66-787, 67-787, 68-787, 
69-787, 70-787, 71-787, 72-787, 73-787, 74-787, 75-787, 76-787, 77-787, 78-787, 79-787, 80-787, 
81-787, 82-787, 83-787, 84-787, 85-787, 86-787, 87-787, 88-787, 89-787, 90-787, 91-787, 92-787, 
93-787, 94-787, 95-787, 96-787, 97-787, 98-787, 99-787, 100-787, 101-787, 102-787, 103-787, 
104-787, 105-787, 106-787, 107-787, 108-787, 109-787, 110-787, 111-787, 112-787, 113-787, 

30 114-787, 115-787, 116-787, 117-787, 118-787, 119-787, 120-787, 121-787, 122-787, 123-787, 
124-787, 125-787, 126-787, 127-787, 128-787, 129-787, 130-787, 131-787, 132-787, 133-787, 
134-787, 135-787, 136-787, 137-787, 138-787, 139-787, 140-787, 141-787, 142-787, 143-787, 
144-787, 145-787, 146-787, 147-787, 148-787, 149-787, 150-787, 151-787, 152-787, 153-787, 
154-787, 155-787, 156-787, 157-787, 158-787, 159-787, 160-787, 161-787, 162-787, 163-787, 

35 164-787, 165-787, 166-787, 167-787, 168-787, 169-787, 170-787, 171-787, 172-787, 173-787, 
174-787, 175-787, 176-787, 177-787, 178-787, 179-787, 180-787, 181-787, 182-787, 183-787, 
184-787, 185-787, 186-787, 187-787, 188-787, 189-787, 190-787, 191-787, 192-787, 193-787, 
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194-787, 195-787, 196-787, 197-787, 198-787, 199-787,200-787,201-787,202-787,203-787, 
204-787, 205-787, 206-787, 207-787, 208-787, 209-787, 210-787, 211-787, 212-787, 213-787, 
214-787, 215-787, 216-787, 217-787, 218-787, 219-787, 220-787, 221-787, 222-787, 223-787, 
224-787, 225-787, 226-787, 227-787, 228-787, 229-787, 230-787, 231-787, 232-787, 233-787, 
5 234-787, 235-787, 236-787, 237-787, 238-787, 239-787, 240-787, 241-787, 242-787, 243-787, 
244-787, 245-787, 246-787, 247-787, 248-787, 249-787, 250-787, 251-787, 252-787, 253-787, 
254-787, 255-787, 256-787, 257-787, 258-787, 259-787, 260-787, 261-787, 262-787, 263-787, 
264-787, 265-787, 266-787, 267-787, 268-787, 269-787, 270-787, 271-787, 272-787, 273-787, 
274-787, 275-787, 276-787, 277-787, 278-787, 279-787, 280-787, 281-787, 282-787, 283-787, 

10 284-787, 285-787, 286-787, 287-787, 288-787, 289-787, 290-787, 291-787, 292-787, 293-787, 
294-787, 295-787, 296-787, 297-787, 298-787, 299-787, 300-787, 301-787, 302-787, 303-787, 
304-787, 305-787, 306-787, 307-787, 308-787, 309-787, 310-787, 311-787, 312-787, 313-787, 
314-787, 315-787, 316-787, 317-787, 318-787, 319-787, 320-787, 321-787, 322-787, 323-787, 
324-787, 325-787, 326-787, 327-787, 328-787, 329-787, 330-787, 331-787, 332-787, 333-787, 

15 334-787, 335-787, 336-787, 337-787, 338-787, 339-787, 340-787, 341-787, 342-787, 343-787, 
344-787, 345-787, 346-787, 347-787, 348-787, 349-787, 350-787, 351-787, 352-787, 353-787, 
354-787, 355-787, 356-787, 357-787, 358-787, 359-787, 360-787, 361-787, 362-787, 363-787, 
364-787, 365-787, 366-787, 367-787, 368-787, 369-787, 370-787, 371-787, 372-787, 373-787, 
374-787, 375-787, 376-787, 377-787, 378-787, 379-787, 380-787, 381-787, 382-787, 383-787, 

20 384-787, 385-787, 386-787, 387-787, 388-787, 389-787, 390-787, 391-787, 392-787, 393-787, 
394-787, 395-787, 396-787, 397-787, 398-787, 399-787, 400-787, 401-787, 402-787, 403-787, 
404-787, 405-787, 406-787, 407-787, 408-787, 409-787, 410-787, 411-787, 412-787, 413-787, 
414-787, 415-787, 416-787, 417-787, 418-787, 419-787, 420-787, 421-787, 422-787, 423-787, 
424-787, 425-787, 426-787, 427-787, 428-787, 429-787, 430-787, 431-787, 432-787, 433-787, 

25 434-787, 435-787, 436-787, 437-787, 438-787, 439-787, 440-787, 441-787, 442-787, 443-787, 
444-787, 445-787, 446-787, 447-787, 448-787, 449-787, 450-787, 451-787, 452-787, 453-787, 
454-787, 455-787, 456-787, 457-787, 458-787, 459-787, 460-787, 461-787, 462-787, 463-787, 
464-787, 465-787, 466-787, 467-787, 468-787, 469-787, 470-787, 471-787, 472-787, 473-787, 
474-787, 475-787, 476-787, 477-787, 478-787, 479-787, 480-787, 481-787, 482-787, 483-787, 

30 484-787, 485-787, 486-787, 487-787, 488-787, 489-787, 490-787, 491-787, 492-787, 493-787, 
494-787, 495-787, 496-787, 497-787, 498-787, 499-787, 500-787, 501-787, 502-787, 503-787, 
504-787, 505-787, 506-787, 507-787, 508-787, 509-787, 510-787, 51 1-787, 512-787, 513-787, 
514-787, 515-787, 516-787, 517-787, 518-787, 519-787, 520-787, 521-787, 522-787, 523-787, 
524-787, 525-787, 526-787, 527-787, 528-787, 529-787, 530-787, 531-787, 532-787, 533-787, 

35 534-787, 535-787, 536-787, 537-787, 538-787, 539-787, 540-787, 541-787, 542-787, 543-787, 
544-787, 545-787, 546-787, 547-787, 548-787, 549-787, 550-787, 551-787, 552-787, 553-787, 
554-787, 555-787, 556-787, 557-787, 558-787, 559-787, 560-787, 561-787, 562-787, 563-787, 
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564-787, 565-787, 566-787, 567-787, 568-787, 569-787, 570-787, 571-787, 572-787, 573-787, 
574-787, 575-787, 576-787, 577-787, 578-787, 579-787, 580-787, 581-787, 582-787, 583-787, 
584-787, 585-787, 586-787, 587-787, 588-787, 589-787, 590-787, 591-787, 592-787, 593-787, 
594-787, 595-787, 596-787, 597-787, 598-787, 599-787, 600-787, 601-787, 602-787, 603-787, 
5 604-787, 605-787, 606-787, 607-787, 608-787, 609-787, 610-787, 61 1-787, 612-787, 613-787, 
614-787, 615-787, 616-787, 617-787, 618-787, 619-787, 620-787, 621-787, 622-787, 623-787, 
624-787, 625-787, 626-787, 627-787, 628-787, 629-787, 630-787, 631-787, 632-787, 633-787, 
634-787, 635-787, 636-787, 637-787, 638-787, 639-787, 640-787, 641-787, 642-787, 643-787, 
644-787, 645-787, 646-787, 647-787, 648-787, 649-787, 650-787, 651-787, 652-787, 653-787, 

10 654-787, 655-787, 656-787, 657-787, 658-787, 659-787, 660-787, 661-787, 662-787, 663-787, 
664-787, 665-787, 666-787, 667-787, 668-787, 669-787, 670-787, 671-787, 672-787, 673-787, 
674-787, 675-787, 676-787, 677-787, 678-787, 679-787, 680-787, 681-787, 682-787, 683-787, 
684-787, 685-787, 686-787, 687-787, 688-787, 689-787, 690-787, 691-787, 692-787, 693-787, 
694-787, 695-787, 696-787, 697-787, 698-787, 699-787, 700-787, 701-787, 702-787, 703-787, 

15 704-787, 705-787, 706-787, 707-787, 708-787, 709-787, 710-787, 71 1-787, 712-787, 713-787, 
714-787, 715-787, 716-787, 717-787, 718-787, 719-787, 720-787, 721-787, 722-787, 723-787, 
724-787, 725-787, 726-787, 727-787, 728-787, 729-787, 730-787, 731-787, 732-787, 733-787, 
734-787, 735-787, 736-787, 737-787, 738-787, 739-787, 740-787, 741-787, 742-787, 743-787, 
744-787, 745-787, 746-787, 747-787, 748-787, 749-787, 750-787, 751-787, 752-787, 753-787, 

20 754-787, 755-787, 756-787, 757-787, 758-787, 759-787, 760-787, 761-787, 762-787, 763-787, 
764-787, 765-787, 766-787, 767-787, 768-787, 769-787, 770-787, 771-787, 772-787, 773-787, 
774-787, 775-787, 776-787, 777-787, 778-787, 779-787, 780-787, 781-787, 782-787, 2-786, 3- 
785, 4-784, 5-783, 6-782, 7-781, 8-780, 9-779, 10-778, 11-777, 12-776, 13-775, 14-774, 15-773, 
16-772, 17-771, 18-770, 19-769, 20-768, 21-767, 22-766, 23-765, 24-764, 25-763, 26-762, 27-761, 
. 25 28-760, 29-759, 30-758, 31-757, 32-756, 33-755, 34-754, 35-753, 36-752, 37-751, 38-750, 39-749, 
40-748, 41-747, 42-746, 43-745, 44-744, 45-743, 46-742, 47-741, 48-740, 49-739, 50-738, 51-737, 
52-736, 53-735, 54-734, 55-733, 56-732, 57-731, 58-730, 59-729, 60-728, 61-727, 62-726, 63-725, 
64-724, 65-723, 66-722, 67-721, 68-720, 69-719, 70-718, 71-717, 72-716, 73-715, 74-714, 75-713, 
76-712, 77-711, 78-710, 79-709, 80-708, 81-707, 82-706, 83-705, 84-704, 85-703, 86-702, 87-701, 

30 88-700, 89-699, 90-698, 91-697, 92-696, 93-695, 94-694, 95-693, 96-692, 97-691, 98-690, 99-689, 
100-688, 101-687, 102-686, 103-685, 104-684, 105-683, 106-682, 107-681, 108-680, 109-679, 
110-678, 111-677, 112-676, 113-675, 114-674, 115-673, 116-672, 117-671, 118-670, 119-669, 
120-668, 121-667, 122-666, 123-665, 124-664, 125-663, 126-662, 127-661, 128-660, 129-659, 
130-658, 131-657, 132-656, 133-655, 134-654, 135-653, 136-652, 137-651, 138-650, 139^649, 

35 140-648, 141-647, 142-646, 143-645, 144-644, 145-643, 146-642, 147-641, 148-640, 149-639, 
150-638, 151-637, 152-636, 153-635, 154-634, 155-633, 156-632, 157-631, 158-630, 159-629, 
160-628, 161-627, 162-626, 163-625, 164-624, 165-623, 166-622, 167-621, 168-620, 169-619, 
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170-618, 171-617, 172-616, 173-615, 174-614, 175-613, 176-612, 177-611, 178-610, 179-609, 
180-608, 181-607, 182-606, 183-605, 184-604, 185-603, 186-602, 187-601, 188-600, 189-599, 
190-598, 191-597, 192-596, 193-595, 194-594, 195-593, 196-592, 197-591, 198-590, 199-589, 
200-588, 201-587, 202-586, 203-585, 204-584, 205-583, 206-582, 207-581, 208-580, 209-579, 
5 210-578, 211-577, 212-576, 213-575, 214-574, 215-573, 216-572, 217-571, 218-570, 219-569, 
220-568, 221-567, 222-566, 223-565, 224-564, 225-563, 226-562, 227-561, 228-560, 229-559, 
230-558, 231-557, 232-556, 233-555, 234-554, 235-553, 236-552, 237-551, 238-550, 239-549, 
240-548, 241-547, 242-546, 243-545, 244-544, 245-543, 246-542, 247-541, 248-540, 249-539, 
250-538, 251-537, 252-536, 253-535, 254-534, 255-533, 256-532, 257-531, 258-530, 259-529, 

10 260-528, 261-527, 262-526, 263-525, 264-524, 265-523, 266-522, 267-521, 268-520, 269-519, 
270-518, 271-517, 272-516, 273-515, 274-514, 275-513, 276-512, 277-511, 278-510, 279-509, 
280-508, 281-507, 282-506, 283-505, 284-504, 285-503, 286-502, 287-501, 288-500, 289-499, 
290-498, 291-497, 292-496, 293-495, 294-494, 295-493, 296-492, 297-491, 298-490, 299-489, 
300-488, 301-487, 302-486, 303-485, 304-484, 305-483, 306-482, 307-481, 308-480, 309-479, 

15 310-478, 311-477, 312-476, 313-475, 314474, 315-473, 316-472, 317-471, 318-470, 319-469, 
320-468, 321-467, 322-466, 323-465, 324-464, 325-463, 326-462, 327-461, 328-460, 329-459, 
330-458, 331-457, 332-456, 333-455, 334-454, 335-453, 336-452, 337-451, 338-450, 339-449, 
340-448, 341447, 342-446, 343-445, 344-444, 345-443, 346-442, 347-441, 348-440, 349-439, 
350-438, 351-437, 352-436, 353-435, 354-434, 355-433, 356-432, 357-431, 358430, 359-429, 

20 360-428, 361-427, 362-426, 363-425, 364-424, 365-423, 366-422, 367-421, 368-420, 369-419, 
370-418, 371-417, 372-416, 373-415, 374-414, 375-413, 376-412, 377-411, 378-410, 379-409, 
380-408, 381-407, 382-406, 383-405, 384-404, 385-403, 386-402, 387-401, 388-400, 389-399, 
390-398, and 391-397, wherein the numbering of amino acids comprising any one fragment is 
consistent with the polypeptide sequence of any one EVEN numbered SEQ ID of the Sequence 

25 listing. 

Further preferred polypeptide fragments of the EVEN numbered SEQ ID NOs. of the 
Sequence listing, and polynucleotides encoding the same, are selected from the group consisting of 
fragments comprising any 50 consecutive amino acids numbered from 1-50, 2-51, 3-52, 4-53, 5- 
54, 6-55, 7-56, 8-57, 9-58, 10-59, 11-60, 12-61, 13-62, 14-63, 15-64, 16-65, 17-66, 18-67, 19-68, 

30 20-69, 21-70, 22-71, 23-72, 24-73, 25-74, 26-75, 27-76, 28-77, 29-78, 30-79, 31-80, 32-81, 33-82, 
34-83, 35-84, 36-85, 37-86, 38-87, 39-88, 40-89, 41-90, 42-91, 43-92, 44-93, 45-94, 46-95, 47-96, 
48-97, 49-98, 50-99, 51-100, 52-101, 53-102, 54-103, 55-104, 56-105, 57-106, 58-107, 59-108, 
60-109, 61-110, 62-111, 63-112, 64-113, 65-114, 66-115, 67-116, 68-117, 69-118, 70-119, 71-120, 
72-121, 73-122, 74-123, 75-124, 76-125, 77-126, 78-127, 79-128, 80-129, 81-130, 82-131, 83-132, 

35 84-133, 85-134, 86-135, 87-136, 88-137, 89-138, 90-139, 91-140, 92-141, 93-142, 94-143, 95-144, 
96-145,97-146, 98-147, 99-148, 100-149, 101-150, 102-151, 103-152, 104-153, 105-154, 106- 
155, 107-156, 108-157, 109-158, 110-159, 111-160, 112-161, 113-162, 114-163, 115-164, 116- 
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165, 117-166, 118-167, 119-168, 120-169, 121-170, 122-171, 123-172, 124-173, 125-174, 126- 
175, 127-176, 128-177, 129-178, 130-179, 131-180, 132-181, 133-182, 134-183, 135-184, 136- 
185, 137-186, 138-187, 139-188, 140-189, 141-190, 142-191, 143-192, 144-193, 145-194, 146- 
195, 147-196, 148-197, 149-198, 150-199, 151-200, 152-201, 153-202, 154-203, 155-204, 156- 

5 205, 157-206, 158-207, 159-208, 160-209, 161-210, 162-211, 163-212, 164-213, 165-214, 166- 
215, 167-216, 168-217, 169-218, 170-219, 171-220, 172-221, 173-222, 174-223, 175-224, 176- 
225, 177-226, 178-227, 179-228, 180-229, 181-230, 182-231, 183-232, 184-233, 185-234, 186- 
235, 187-236, 188-237, 189-238, 190-239, 191-240, 192-241, 193-242, 194-243, 195-244, 196- 
245, 197-246, 198-247, 199-248, 200-249, 201-250, 202-251, 203-252, 204-253, 205-254, 206- 

10 255, 207-256, 208-257, 209-258, 210-259, 21 1-260, 212-261, 213-262, 214-263, 215-264, 216- 
265, 217-266, 218-267, 219-268, 220-269, 221-270, 222-271, 223-272, 224-273, 225-274, 226- 
275, 227-276, 228-277, 229-278, 230-279, 231-280, 232-281, 233-282, 234-283, 235-284, 236- 
285, 237-286, 238-287, 239-288, 240-289, 241-290, 242-291, 243-292, 244-293, 245-294, 246- 
295, 247-296, 248-297, 249-298, 250-299, 251-300, 252-301, 253-302, 254-303, 255-304, 256- 

15 305, 257-306, 258-307, 259-308, 260-309, 261-310, 262-31 1, 263-312, 264-313, 265-314, 266- 
315, 267-316, 268-317, 269-318, 270-319, 271-320, 272-321, 273-322, 274-323, 275-324, 276- 
325, 277-326, 278-327, 279-328, 280-329, 281-330, 282-331, 283-332, 284-333, 285-334, 286- 
335, 287-336, 288-337, 289-338, 290-339, 291-340, 292-341, 293-342, 294-343, 295-344, 296- 
345, 297-346, 298-347, 299-348, 300-349, 301-350, 302-351, 303-352, 304-353, 305-354, 306- 

20 355, 307-356, 308-357, 309-358, 310-359, 311-360, 312-361, 313-362, 314-363, 315-364, 316- 
365, 317-366, 318-367, 319-368, 320-369, 321-370, 322-371, 323-372, 324-373, 325-374, 326- 
375, 327-376, 328-377, 329-378, 330-379, 331-380, 332-381, 333-382, 334-383, 335-384, 336- 
385, 337-386, 338-387, 339-388, 340-389, 341-390, 342-391, 343-392, 344-393, 345-394, 346- 
395, 347-396, 348-397, 349-398, 350-399, 351-400, 352-401, 353-402, 354-403, 355-404, 356- 

25 405, 357-406, 358-407, 359-408, 360-409, 361-410, 362-41 1, 363-412, 364-413, 365-414, 366- 
415, 367-416, 368-417, 369-418, 370-419, 371-420, 372-421, 373-422, 374-423, 375-424, 376- 
425, 377-426, 378-427, 379-428, 380-429, 381-430, 382-431, 383-432, 384-433, 385-434, 386- 
435, 387-436, 388-437, 389-438, 390-439, 391-440, 392-441, 393-442, 394-443, 395444, 396- 
445, 397-446, 398-447, 399-448, 400-449, 401-450, 402-451, 403-452, 404-453, 405-454, 406- 

30 455, 407-456, 408-457, 409-458, 410-459, 41 1-460, 412-461, 413-462, 414-463, 415-464, 416- 
465, 417-466, 418-467, 419-468, 420-469, 421-470, 422-471, 423-472, 424^173, 425474, 426- 
475, 427-476, 428-477, 429-478, 430-479, 431-480, 432481, 433482, 434-483, 435484, 436- 
485, 437486, 438-487, 439^88, 440-489, 441490, 442491, 443-492, 444-493, 445494, 446- 
495, 447-496, 448-497, 449-498, 450^199, 451-500, 452-501, 453-502, 454-503, 455-504, 456- 

35 505, 457-506, 458-507, 459-508, 460-509, 461-510, 462-511, 463-512, 464-513, 465-514, 466- 
515, 467-516, 468-517, 469-518, 470-519, 471-520, 472-521, 473-522, 474-523, 475-524, 476- 
525, 477-526, 478-527, 479-528, 480-529, 481-530, 482-531, 483-532, 484-533, 485-534, 486- 
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535, 487-536, 488-537, 489-538, 490-539, 491-540, 492-541, 493-542, 494-543, 495-544, 496- 
545, 497-546, 498-547, 499-548, 500-549, 501-550, 502-551, 503-552, 504-553, 505-554, 506- 
555, 507-556, 508-557, 509-558, 510-559, 511-560, 512-561, 513-562, 514-563, 515-564, 516- 
565, 517-566, 518-567, 519-568, 520-569, 521-570, 522-571, 523-572, 524-573, 525-574, 526- 
5 575, 527-576, 528-577, 529-578, 530-579, 531-580, 532-581, 533-582, 534-583, 535-584, 536- 
585, 537-586, 538-587, 539-588, 540-589, 541-590, 542-591, 543-592, 544-593, 545-594, 546- 
595, 547-596, 548-597, 549-598, 550-599, 551-600, 552-601, 553-602, 554-603, 555-604, 556- 
605, 557-606, 558-607, 559-608, 560-609, 561-610, 562-611, 563-612, 564-613, 565-614, 566- 
615, 567-616, 568-617, 569-618, 570-619, 571-620, 572-621, 573-622, 574-623, 575-624, 576- 

10 625, 577-626, 578-627, 579-628, 580-629, 581-630, 582-631, 583-632, 584-633, 585-634, 586- 
635, 587-636, 588-637, 589-638, 590-639, 591-640, 592-641, 593-642, 594-643, 595-644, 596- 
645, 597-646, 598-647, 599-648, 600-649, 601-650, 602-651, 603-652, 604-653, 605-654, 606- 
655, 607-656, 608-657, 609-658, 610-659, 611-660, 612-661, 613-662, 614-663, 615-664, 616- 
665, 617-666, 618-667, 619-668, 620-669, 621-670, 622-671, 623-672, 624-673, 625-674, 626- 

15 675, 627-676, 628-677, 629-678, 630-679, 631-680, 632-681, 633-682, 634-683, 635-684, 636- 
685, 637-686, 638-687, 639-688, 640-689, 641-690, 642-691, 643-692, 644-693, 645-694, 646- 
695, 647-696, 648-697, 649-698, 650-699, 651-700, 652-701, 653-702, 654-703, 655-704, 656- 
705, 657-706, 658-707, 659-708, 660-709, 661-710, 662-711, 663-712, 664-713, 665-714, 666- 
715, 667-716, 668-717, 669-718, 670-719, 671-720, 672-721, 673-722, 674-723, 675-724, 676- 

20 725, 677-726, 678-727, 679-728, 680-729, 681-730, 682-731, 683-732, 684-733, 685-734, 686- 
735, 687-736, 688-737, 689-738, 690-739, 691-740, 692-741, 693-742, 694-743, 695-744, 696- 
745, 697-746, 698-747, 699-748, 700-749, 701-750, 702-751, 703-752, 704-753, 705-754, 706- 
755, 707-756, 708-757, 709-758, 710-759, 711-760, 712-761, 713-762, 714-763, 715-764, 716- 
765, 717-766, 718-767, 719-768, 720-769, 721-770, 722-771, 723-772, 724-773, 725-774, 726- 

25 775, 727-776, 728-777, 729-778, 730-779, 731-780, 732-781, 733-782, 734-783, 735-784, 736- 
785, 737-786, and 738-787, wherein the numbering of amino acids comprising any one fragment is 
consistent with the polypeptide sequence of any one EVEN numbered SEQ ID of the Sequence 
listing. 

Further preferred polypeptide fragments of the EVEN numbered SEQ ID NOs. of the 
30 Sequence listing, and polynucleotides encoding the same, are selected from the group consisting of 
fragments comprising any 100 consecutive amino acids numbered from 1-100, 2-101, 3-102, 4- 
103, 5-104,6-105,7-106, 8-107,9-108, 10-109, 11-110,12-111,13-112, 14-113, 15-114, 16-115, 
17-116, 18-117, 19-118, 20-119, 21-120, 22-121, 23-122, 24-123, 25-124, 26-125, 27-126, 28-127, 
29-128, 30-129, 31-130, 32-131, 33-132, 34-133, 35-134, 36-135, 37-136, 38-137, 39-138, 40-139, 
35 41-140, 42-141, 43-142, 44-143, 45-144, 46-145, 47-146, 48-147, 49-148, 50-149, 51-150, 52-151, 
53-152, 54-153, 55-154, 56-155, 57-156, 58-157, 59-158, 60-159, 61-160, 62-161, 63-162, 64-163, 
65-164, 66-165, 67-166, 68-167, 69-168, 70-169, 71-170, 72-171, 73-172, 74-173, 75-174, 76-175, 
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77-176, 78-177, 79-178, 80-179, 81-180, 82-181, 83-182, 84-183, 85-184, 86-185, 87-186, 88-187, 
89-188, 90-189, 91-190, 92-191, 93-192, 94-193, 95-194, 96-195, 97-196, 98-197, 99-198, 100- 
199, 101-200, 102-201, 103-202, 104-203, 105-204, 106-205, 107-206, 108-207, 109-208, 110- 
209, 111-210, 112-211, 113-212, 114-213, 115-214, 116-215, 117-216, 118-217, 119-218, 120- 

5 219, 121-220, 122-221, 123-222, 124-223, 125-224, 126-225, 127-226, 128-227, 129-228, 130- 
229, 131-230, 132-231, 133-232, 134-233, 135-234, 136-235, 137-236, 138-237, 139-238, 140- 
239, 141-240, 142-241, 143-242, 144-243, 145-244, 146-245, 147-246, 148-247, 149-248, 150- 
249, 151-250, 152-251, 153-252, 154-253, 155-254, 156-255, 157-256, 158-257, 159-258, 160- 
259, 161-260, 162-261, 163-262, 164-263, 165-264, 166-265, 167-266, 168-267, 169-268, 170- 

10 269, 171-270, 172-271, 173-272, 174-273, 175-274, 176-275, 177-276, 178-277, 179-278, 180- 
279, 181-280, 182-281, 183-282, 184-283, 185-284, 186-285, 187-286, 188-287, 189-288, 190- 
289, 191-290, 192-291, 193-292, 194-293, 195-294, 196-295, 197-296, 198-297, 199-298,200- 
299, 201-300, 202-301, 203-302, 204-303, 205-304, 206-305, 207-306, 208-307, 209-308, 210- 
309, 211-310, 212-311, 213-312, 214-313, 215-314, 216-315, 217-316, 218-317, 219-318, 220- 

15 319, 221-320, 222-321, 223-322, 224-323, 225-324, 226-325, 227-326, 228-327, 229-328, 230- 
329, 231-330, 232-331, 233-332, 234-333, 235-334, 236-335, 237-336, 238-337, 239-338, 240- 
339, 241-340, 242-341, 243-342, 244-343, 245-344, 246-345, 247-346, 248-347, 249-348, 250- 
349, 251-350, 252-351, 253-352, 254-353, 255-354, 256-355, 257-356, 258-357, 259-358, 260- 
359, 261-360, 262-361, 263-362, 264-363, 265-364, 266-365, 267-366, 268-367, 269-368, 270- 

20 369, 271-370, 272-371, 273-372, 274-373, 275-374, 276-375, 277-376, 278-377, 279-378, 280- 
379, 281-380, 282-381, 283-382, 284-383, 285-384, 286-385, 287-386, 288-387, 289-388, 290- 
389, 291-390, 292-391, 293-392, 294-393, 295-394, 296-395, 297-396, 298-397, 299-398, 300- 
399, 301-400, 302-401, 303-402, 304-403, 305-404, 306-405, 307-406, 308-407, 309-408, 310- 
409, 311-410, 312-411, 313-412, 314-413, 315-414, 316-415, 317-416, 318-417, 319-418, 320- 

25 419, 321-420, 322-421, 323-422, 324-423, 325-424, 326-425, 327-426, 328-427, 329^28, 330- 
429, 331-430, 332-431, 333-432, 334-433, 335-434, 336-435, 337-436, 338-437, 339-438, 340- 
439, 341-440, 342-441, 343^42, 344-443, 345-444, 346-445, 347-446, 348-447, 349-448, 350- 
449, 351-450, 352-451, 353-452, 354-453, 355-454, 356-455, 357-456, 358-457, 359-458, 360- 
459, 361-460, 362-461, 363-462, 364-463, 365-464, 366-465, 367-466, 368-467, 369-468, 370- 

30 469, 371-470, 372-471, 373-472, 374-473, 375-474, 376-475, 377-476, 378-477, 379-478, 380- 
479, 381-480, 382-481, 383-482, 384-483, 385-484, 386-485, 387-486, 388-487, 389-488, 390- 
489, 391-490, 392-491, 393-492, 394-493, 395^194, 396-495, 397-496, 398-497, 399-498, 400- 
499, 401-500, 402-501, 403-502, 404-503, 405-504, 406-505, 407-506, 408-507, 409-508, 410- 
509, 411-510, 412-511, 413-512, 414-513, 415-514, 416-515, 417-516, 418-517, 419-518, 420- 

35 519, 421-520, 422-521, 423-522, 424-523, 425-524, 426-525, 427-526, 428-527, 429-528, 430- 
529, 431-530, 432-531, 433-532, 434-533, 435-534, 436-535, 437-536, 438-537, 439-538, 440- 
539, 441-540, 442-541, 443-542, 444-543, 445-544, 446-545, 447-546, 448-547, 449-548, 450- 
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549, 451-550, 452-551, 453-552, 454-553, 455-554, 456-555, 457-556, 458-557, 459-558, 460- 
559, 461-560, 462-561, 463-562, 464-563, 465-564, 466-565, 467-566, 468-567, 469-568, 470- 
569, 471-570, 472-571, 473-572, 474-573, 475-574, 476-575, 477-576, 478-577, 479-578, 480- 
579, 481-580, 482-581, 483-582, 484-583, 485-584, 486-585, 487-586, 488-587, 489-588, 490- 
5 589, 491-590, 492-591, 493-592, 494-593, 495-594, 496-595, 497-596, 498-597, 499-598, 500- 
599, 501-600, 502-601, 503-602, 504-603, 505-604, 506-605, 507-606, 508-607, 509-608, 510- 
609, 511-610, 512-611, 513-612, 514-613, 515-614, 516-615, 517-616, 518-617, 519-618, 520- 
619, 521-620, 522-621, 523-622, 524-623, 525-624, 526-625, 527-626, 528-627, 529-628, 530- 
629, 531-630, 532-631, 533-632, 534-633, 535-634, 536-635, 537-636, 538-637, 539-638, 540- 

10 639, 541-640, 542-641, 543-642, 544-643, 545-644, 546-645, 547-646, 548-647, 549-648, 550- 
649, 551-650, 552-651, 553-652, 554-653, 555-654, 556-655, 557-656, 558-657, 559-658, 560- 
659, 561-660, 562-661, 563-662, 564-663, 565-664, 566-665, 567-666, 568-667, 569-668, 570- 
669, 571-670, 572-671, 573-672, 574-673, 575-674, 576-675, 577-676, 578-677, 579-678, 580- 
679, 581-680, 582-681, 583-682, 584-683, 585-684, 586-685, 587-686, 588-687, 589-688, 590- 

15 689, 591-690, 592-691, 593-692, 594-693, 595-694, 596-695, 597-696, 598-697, 599-698, 600- 
699, 601-700, 602-701, 603-702, 604-703, 605-704, 606-705, 607-706, 608-707, 609-708, 610- 
709, 611-710, 612-711, 613-712, 614-713, 615-714, 616-715, 617-716, 618-717, 619-718, 620- 
719, 621-720, 622-721, 623-722, 624-723, 625-724, 626-725, 627-726, 628-727, 629-728, 630- 
729, 631-730, 632-731, 633-732, 634-733, 635-734, 636-735, 637-736, 638-737, 639-738, 640- 

20 739, 641-740, 642-741, 643-742, 644-743, 645-744, 646-745, 647-746, 648-747, 649-748, 650- 
749, 651-750, 652-751, 653-752, 654-753, 655-754, 656-755, 657-756, 658-757, 659-758, 660- 
759, 661-760, 662-761, 663-762, 664-763, 665-764, 666-765, 667-766, 668-767, 669-768, 670- 
769, 671-770, 672-771, 673-772, 674-773, 675-774, 676-775, 677-776, 678-777, 679-778, 680- 
779, 681-780, 682-781, 683-782, 684-783, 685-784, 686-785, 687-786, and 688-787, wherein the 

25 numbering of amino acids comprising any one fragment is consistent with the polypeptide 
sequence of any one EVEN numbered SEQ ID of the Sequence listing. 

These specific embodiments, and other polypeptide and polynucleotide fragment 
embodiments described herein may be modified as being "at least", "equal to", "equal to or less 
than", "less than", "at least but not greater than " or "from to ". a specified size or 

30 specified N-terminal and/or C-terminal positions. It is noted that all ranges used to describe any 
embodiment of the present invention are inclusive unless specifically set forth otherwise. 

The present invention also provides for the exclusion of any individual fragment specified 
by N-terminal and C-terminal positions or of any fragment specified by size in amino acid residues 
as described above. In addition, any number of fragments specified by N-terminal and C-terminal 

35 positions or by size in amino acid residues as described above may be excluded as individual 
species. Further, any number of fragments specified by N-terminal and C-terminal positions or by 
size in amino acid residues as described above may mate up a polypeptide fragment in any 



57 



WO 02/094864 



PCT/IB01/01715 



combination and may optionally include non-GENSET and GENSET-Related polypeptide 
sequences as well. 

The above polypeptide fragments of the present invention can be immediately envisaged 
using the above description and are therefore not individually listed solely for the purpose of not 
5 unnecessarily lengthening the specification. Moreover, the above fragments need not have a 
GENSET biological activity, although polypeptides having these activities are preferred 
embodiments of the invention, since they would be useful, for example, in immunoassays, in 
epitope mapping, epitope tagging, as vaccines, and as molecular weight markers. The above 
fragments may also be used to generate antibodies to a particular portion of the polypeptide. These 

10 antibodies can then be used in immunoassays well known in the art to distinguish between human 
and non-human cells and tissues or to determine whether cells or tissues in a biological sample are 
or are not of the same type which express the polypeptides of the present invention. 

It is noted that the above species of polypeptide fragments of the present invention may 
alternatively be described by the formula "a to b"; where "a" equals the N-terminal most amino 

15 acid position and "b" equals the C-terminal most amino acid position of the polynucleotide; and 
further where "a" equals an integer between 1 and the number of amino acids of the polypeptide 
sequence of the present invention minus 6, and where "b" equals an integer between 7 and the 
number of amino acids of the polypeptide sequence of the present invention; and where "a" is an 
integer smaller then "b ,? by at least 6. 

20 The present invention also provides for the exclusion of any species of polypeptide 

fragments of the present invention specified by 5' and 3' positions or sub-genuses of polypeptides 
specified by size in amino acids as described above. Any number of fragments specified by 5' and 
3* positions or by size in amino acids, as described above, may be excluded. 
Functional definition 

25 Domains 

Preferred polynucleotide fragments of the invention comprise domains of polypeptides of 
the invention. Such domains may eventually comprise linear or structural motifs and signatures 
including, but not limited to, leucine zippers, helix-turn-helix motifs, post-translational 
modification sites such as glycosylation sites, ubiquitination sites, alpha helices, and beta sheets, 

30 signal sequences encoding signal peptides which direct the secretion of the encoded proteins, 
sequences implicated in transcription regulation such as homeoboxes, acidic stretches, enzymatic 
active sites, substrate binding sites, and enzymatic cleavage sites. Such domains may present a 
particular biological activity such as DNA or RNA-binding, secretion of proteins, transcription 
regulation, enzymatic activity, substrate binding activity, etc. 

35 In a preferred embodiment, domains comprise a number of amino acids that is any integer 

between 6 and 1000. Domains may be synthesized using any methods known to those skilled in 
the art, including those disclosed herein. Methods for determining the amino acids which make up 



58 



WO 02/094864 



PCT/IB01/01715 



a domain with a particular biological activity include mutagenesis studies and assays to determine 
the biological activity to be tested. 

Alternatively, the polypeptides of the invention may be scanned for motifs, domains and/or 
signatures in databases using any computer method known to those skilled in the art Searchable 

5 databases include Prosite [Hofmann et al, (1999) Nucl. Acids Res. 27:215-219; Bucher and 
Bairoch (1994) Proceedings 2nd International Conference on Intelligent Systems for Molecular 
Biology. Altaian et al, Eds., pp53-61, AAAIPress, Menlo Park], Pfam [Sonnhammer, et aL 9 (1997) 
Proteins. 28(3):405-20; Henikoffe* al, (2000) Electrophoresis 21(9):1700-6; Bateman et al, 
(2000) Nucleic Acids Res. 28(l):263-6], Blocks [Henikoff et al, (2000) Nucleic Acids Res. 

10 28(l):228-30], Print [Attwood et a/., (1996) Nucleic Acids Res. 24(1): 182-8], Prodom 

[Sonnhammer and Kahn, (1994) Protein Sci. 3(3):482-92; Corpet et al (2000) Nucleic Acids Res. 
28(l):267-9], Sbase [Pongor et al (1993) Protein Eng. 6(4):391-5; Murvai et al, (2000) Nucleic 
Acids Res. 28(l):260-2], Smart [Schultz et al (1998) Proc Natl Acad Sci USA 95, 5857-5864], 
Dali/FSSP [Holm and Sander (1996) Nucleic Acids Res. 24(l):206-9, Holm and Sander (1997) 

15 Nucleic Acids Res. 25(l):231-4 and Holm and Sander (1999) Nucleic Acids Res. 27(l):244-7], 
HSSP [Sander and Schneider (1991) Proteins. 9(l):56-68.], CATH [Orengo et al, (1997) 
Structure. 5(8):1093-108; Pearl et al, (2000) Biochem Soc Trans. 28(2):269-75], SCOP [Murzin et 
al, (1995) J Mol Biol. 247(4):536-40; Lo Conte et al, (2000) Nucleic Acids Res. 28(l):257-9], 
COG [Tatusov et al (1997), Science, 278, 631 :637 and Tatusov et al (2000), Nucleic Acids Res. 

20 28(l):33-6], specific family databases and derivatives thereof [Nevill-Manning et al, (1998) Proc. 
Natl. Acad. Sci. US A. 95, 5865-5871; Yona,e* al, (1999), Proteins. 37(3):360-78; Attwoodef 
al, (2000) Nucleic Acids Res. 28(l):225-7], each of which disclosures are hereby incorporated by 
reference in their entireties. For a review on available databases, see issue 1 of volume 28 of 
Nucleic Acid Research (2000), which disclosure is hereby incorporated by reference in its entirety. 

25 Epitopes and Antibody Fusions: 

A preferred embodiment of the present invention is directed to epitope-bearing 
polypeptides and epitope-bearing polypeptide fragments. These epitopes may be "antigenic 
epitopes" or both an "antigenic epitope" and an "immunogenic epitope". An "immunogenic 
epitope" is defined as a part of a protein that elicits an antibody response in vivo when the 

30 polypeptide is the immunogen. On the other hand, a region of polypeptide to which an antibody 
binds is defined as an "antigenic determinant" or "antigenic epitope." The number of 
immunogenic epitopes of a protein generally is less than the number of antigenic epitopes [see, 
e.g., Geysen et al, (1984), Proc. Natl. Acad. Sci. U.S.A. 81:3998-4002, which disclosure is hereby 
incorporated by reference in its entirety]. It is particularly noted that although a particular epitope 

35 may not be immunogenic, it is nonetheless useful since antibodies can be made to both 

immunogenic and antigenic epitopes. When the antigen is a polypeptide, it is customary to classify 
epitopes as being linear (i.e., composed of a contiguous sequence of amino acids repeated along 
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the polypeptide chain) or nonlinear (i.e., composed of amino acids brought into proximity as a 
result of the folding of the polypeptide chain). Nonlinear epitopes are also called "conformational" 
because they arise through the folding of the polypeptide chain into a particular conformation, i.e., 
a distinctive 3-D shape. 

5 An epitope can comprise as few as 3 amino acids in a spatial conformation, which is 

unique to the epitope. Generally an epitope consists of at least 6 such amino acids, and more often 
at least 8-10 such amino acids. In preferred embodiment, antigenic epitopes comprise a number of 
amino acids that is any integer between 3 and 50. Fragments which function as epitopes may be 
produced by any conventional means [see, e.g., Houghten (1985), Proc. Natl. Acad. Sci. USA 

10 82:5131-5135], also further described in U.S. Patent No. 4,631,21, which disclosures are hereby 
incorporated by reference in their entireties. Methods for determining the amino acids which make 
up an epitope include x-ray crystallography, 2-dimensional nuclear magnetic resonance, and 
epitope mapping, e.g., the Pepscan method described by Geysen, et ah (1984); PCT Publication 
No. WO 84/03564; and PCT Publication No. WO 84/03506, which disclosures are hereby 

15 incorporated by reference in their entireties. Nonlinear epitopes are determined by methods such 
as protein footprinting (U.S. Patent 5,691,448, which disclosure is hereby incorporated by 
reference in its entirety). Another example is the algorithm of Jameson and Wolf, (1988), Comp. 
Appl. Biosci. 4:181-186 (said reference incorporated by reference in its entirety). The Jameson- 
Wolf antigenic analysis, for example, may be performed using the computer program PROTEAN, 

20 using default parameters (Version 4.0 Windows, DNASTAR, Inc., 1228 South Park Street 
Madison, WI. 

All fragments of the polypeptides of the present invention, at least 6 amino acids residues 
in length, are included in the present invention as being useful as antigenic linear epitopes. Amino 
acid residues comprising other immunogenic epitopes may be determined by Jameson-Wolf 

25 analysis, by other similar algorithms, or by in vivo testing for an antigenic response using the 

methods described herein or those known in the art. Immunogenic epitopes predicted by algorithm 
analysis describe only amino acid residues comprising linear epitopes predicted to have the highest 
degree of immunogenicity. Polypeptides of the present invention that are not specifically 
described as immunogenic are not considered non-antigenic as they may be antigenic in vivo. 

30 Alternatively, the polypeptides are most likely antigenic in vitro using methods such as phage 
display. 

Preferably, the epitope-containing polypeptide comprises a contiguous span of at least 6, 
preferably at least 8 to 10, more preferably 12, 15, 20, 25, 30, 35, 40, 50, 60, 75, 100, 125, 150, 
175, 200, 225, 250, 275, or 300 amino acids of a polypeptide of the present invention. 
35 Nonlinear epitopes comprise more than one noncontiguous polypeptide sequence of at 

least one amino acid each. Such epitopes result from noncontiguous polypeptides brought into 
proximity by secondary, tertiary, or quaternary structural features. Therefore, the present invention 
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encompasses isolated, purified, or recombinant polypeptides and fragments thereof which 
comprise a nonlinear epitope. Preferred polypeptides providing nonlinear epitopes are formed by a 
contiguous surface of natively folded protein and are thus at least 10 amino acids in length, further 
preferably 12, 15, 20, 25, 30, 35, 40, 50, 60, 75, 100, 125, 150, 175, 200, 225, 250, 275, or 300 
5 amino acids of a polypeptide of the present invention, to the extent that a contiguous span of these 
lengths is consistent with the lengths of said selected sequence. Further preferred polypeptides 
comprise full-length polypeptide sequences selected from the group consisting of the polypeptide 
sequences of the Sequence Listing. Additionally, nonlinear epitopes may be formed by synthetic 
peptides that mimic an antigenic site or contiguous surface normally presented on a protein in the 

10 native conformation. Therefore, preferred polypeptides providing nonlinear epitopes may be 

formed by synthetic proteins that comprise a combination of at least 5, 6, 7, 8, 9, 10, 12, 15, 20, 25, 
30, 35, 40, 50, 60, 75, 100, 125, 150, 175, 200, 225, 250, 275, or 300 amino acids. 

The epitope-bearing fragments of the present invention preferably comprise 6 to 50 amino 
acids (i.e. any integer between 6 and 50, inclusive) of a polypeptide of the present invention. Also, 

15 included in the present invention are antigenic fragments between the integers of 6 and the full 
length GENSET sequence of the sequence listing. All combinations of sequences between the 
integers of 6 and the full-length sequence of a GENSET polypeptide are included. The epitope- 
bearing fragments may be specified by either the number of contiguous amino acid residues (as a 
sub-genus) or by specific N-terminal and C-terminal positions (as species) as described above for 

20 the polypeptide fragments of the present invention. Any number of epitope-bearing fragments of 
the present invention may also be excluded in the same manner. 

Antigenic epitopes are useful, for example, to raise antibodies, including monoclonal 
antibodies that specifically bind the epitope (see, Wilson et a/., 1984; and Sutcliffe et al 9 (1983), 
Science. 219:660-666, which disclosures are hereby incorporated by reference in their entireties). 

25 The antibodies are then used in various techniques such as diagnostic and tissue/cell identification 
techniques, as described herein, and in purification methods such as immunoaffinity 
chromatography. 

Similarly, immunogenic epitopes can be used to induce antibodies according to methods 
well known in the art (see, Sutcliffe et al. y supra; Wilson et aL 9 supra; Chow et al, (1985), Proc. 

30 Natl. Acad. Sci. USA. 82:910-914; and Bittle et al, (1985), Virol. 66:2347-2354, which 

disclosures are hereby incorporated by reference in their entireties). A preferred immunogenic 
epitope includes the natural GENSET protein. The immunogenic epitopes may be presented 
together with a carrier protein, such as an albumin, to an animal system (such as rabbit or mouse) 
or, if it is long enough (at least about 25 amino acids), without a carrier. However, immunogenic 

35 epitopes comprising as few as 8 to 10 amino acids have been shown to be sufficient to raise 

antibodies capable of binding to, at the very least, linear epitopes in a denatured polypeptide (e.g., 
in Western blotting.). 
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Epitope-bearing polypeptides of the present invention are used to induce antibodies 
according to methods well known in the art including, but not limited to, in vivo immunization, in 
vitro immunization, and phage display methods (see, e.g, Sutcliffe, et al., supra; Wilson, et al., 
supra, and Bittle, et al, supra). If in vivo immunization is used, animals may be immunized with 
5 free peptide; however, anti-peptide antibody titer may be boosted by coupling of the peptide to a 
macromolecular carrier, such as keyhole limpet hemacyanin (KLH) or tetanus toxoid. For 
instance, peptides containing cysteine residues may be coupled to a carrier using a linker such as 
-maleimidobenzoyl- N-hycfroxysuccinimide ester (MBS), while other peptides may be coupled to 
carriers using a more general linking agent such as glutaraldehyde. Animals such as rabbits, rats 

1 0 and mice are immunized with either free or carrier-coupled peptides, for instance, by 

intraperitoneal and/or intradermal injection of emulsions containing about 100 jugs of peptide or 
carrier protein and Freund's adjuvant. Several booster injections may be needed, for instance, at 
intervals of about two weeks, to provide a useful titer of anti-peptide antibody, which can be 
detected, for example, by ELISA assay using free peptide adsorbed to a solid surface. The titer of 

1 5 anti-peptide antibodies in serum from an immunized animal may be increased by selection of 
anti-peptide antibodies, for instance, by adsorption to the peptide on a solid support and elution of 
the selected antibodies according to methods well known in the art. 

As one of skill in the art will appreciate, and discussed above, the polypeptides of the 
present invention comprising an immunogenic or antigenic epitope can be fused to heterologous • 

20 polypeptide sequences. For example, the polypeptides of the present invention may be fused with 
the constant domain of immunoglobulins (IgA, IgE, IgG, IgM), or portions thereof (CHI, CH2, 
CH3, any combination thereof including both entire domains and portions thereof) resulting in 
chimeric polypeptides. These fusion proteins facilitate purification, and show an increased 
half-life in vivo. This has been shown, e.g., for chimeric proteins consisting of the first two 

25 domains of the human CD4-polypeptide and various domains of the constant regions of the heavy 
or light chains of mammalian immunoglobulins [see, e.g, EPA 0,394,827; and Traunecker et al, 
(1988), Nature. 331:84-86, which disclosures are hereby incorporated by reference in their 
entireties]. Fusion proteins that have a disulfide-linked dimeric structure due to the IgG portion 
can also be more efficient in binding and neutralizing other molecules than monomeric 

30 polypeptides or fragments thereof alone [see, e.g., Fountoulakis et al, (1995) Biochem. 270:3958- 
3964, which disclosure is hereby incorporated by reference in its entirety]. Nucleic acids encoding 
the above epitopes can also be recombined with a gene of interest as an epitope tag to aid in 
detection and purification of the expressed polypeptide. 

Additional fusion proteins of the invention may be generated through the techniques of 

3 5 gene-shuffling, motif-shuffling, exon-shuffling, or codon-shuffling (collectively referred to as 
"DNA shuffling"). DNA shuffling may be employed to modulate the activities of polypeptides of 
the present invention thereby effectively generating agonists and antagonists of the polypeptides. 
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See, for example, U.S. Patent Nos.: 5,605,793; 5,81 1,238; 5,834,252; 5,837,458; and Patten, et al 
(1997), Curr Opinion Biotechnol. 8:724-733; Harayama (1998), Trends Biotechnol. 16(2): 76-82; 
Hansson et al 9 (1999), J. Mol. Biol. 287:265-276; and Lorenzo and Blasco (1998) Biotechniques. 
24(2):308-313. (Each of these documents are hereby incorporated by reference). In one 
5 embodiment, one or more components, motifs, sections, parts, domains, fragments, etc., of coding 
polynucleotides of the invention, or the polypeptides encoded thereby may be recombined with one 
or more components, motifs, sections, parts, domains, fragments, etc. of one or more heterologous 
molecules. 

The present invention further encompasses any combination of the polypeptide fragments 
10 listed in this section. 
Antibodies 
Definitions 

The present invention further relates to antibodies and T-cell antigen receptors (TCR), 
which specifically bind the polypeptides, and more specifically, the epitopes of the polypeptides of 

15 the present invention. The antibodies of the present invention include IgG (including IgGl , IgG2, 
IgG3, and IgG4) a IgA (including IgAl and IgA2), IgD, IgE, or IgM, and IgY. The term "antibody" 
(Ab) refers to a polypeptide or group of polypeptides which are comprised of at least one binding 
domain, where a binding domain is formed from the folding of variable domains of an antibody 
molecule to form three-dimensional binding spaces with an internal surface shape and charge 

20 distribution complementary to the features of an antigenic determinant of an antigen, which allows 
an immunological reaction with the antigen. As used herein, the term "antibody" is meant to 
include whole antibodies, including single-chain whole antibodies, and antigen binding fragments 
thereof. In a preferred embodiment the antibodies are human antigen binding antibody fragments 
of the present invention include, but are not limited to, Fab, Fab 1 F(ab)2 and F(ab02, Fd, single- 

25 chain Fvs (scFv), single-chain antibodies, disulfide-linked Fvs (sdEv) and fragments comprising 
either a V L or V H domain. The antibodies may be from any animal origin including birds and 
mammals. Preferably, the antibodies are human, murine, rabbit, goat, guinea pig, camel, horse, or 
chicken. 

Antigen-binding antibody fragments, including single-chain antibodies, may comprise the 
30 variable region(s) alone or in combination with the entire or partial of the following: hinge region, 
CHI, CH2, and CH3 domains. Also included in the invention are any combinations of variable 
region(s) and hinge region, CHI, CH2, and CH3 domains. The present invention further includes 
chimeric, humanized, and human monoclonal and polyclonal antibodies, which specifically bind 
the polypeptides of the present invention. The present invention further includes antibodies that 
35 are anti-idiotypic to the antibodies of the present invention. 

The antibodies of the present invention may be monospecific, bispecific, and trispecific or 
have greater multispecificity. Multispecific antibodies may be specific for different epitopes of a 
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polypeptide of the present invention or may be specific for both a polypeptide of the present 
invention as well as for heterologous compositions, such as a heterologous polypeptide or solid 
support material. See, e.g., WO 93/17715; WO 92/08802; WO 91/00360; WO 92/05793; Tutt, et 
aU (1991), J. Immunol. 147:60-69; US Patents 5,573,920, 4,474,893, 5,601,819, 4,714,681, 
5 4,925,648; Kostelny et aL, (1992), J. Immunol. 148:1547-1553, which disclosures are hereby 
incorporated by reference in their entireties. 

Antibodies of the present invention may be described or specified in terms of the 
epitope(s) or epitope-bearing portion(s) of a polypeptide of the present invention, which are 
recognized or specifically bound by the antibody. The antibodies may specifically bind a complete 

1 0 protein encoded by a nucleic acid of the present invention, or a fragment thereof. Therefore, the 
epitope(s) or epitope bearing polypeptide portion(s) may be specified as described herein, e.g., by 
N-terminal and C-terminal positions, by size in contiguous amino acid residues, or otherwise 
described herein (including the sequence listing). Antibodies which specifically bind any epitope 
or polypeptide of the present invention may also be excluded as individual species. Therefore, the 

15 present invention includes antibodies that specifically bind specified polypeptides of the present 
invention, and allows for the exclusion of the same. 

Thus, another embodiment of the present invention is a purified or isolated antibody 
capable of specifically binding to a polypeptide of the present invention. In one aspect of this 
embodiment, the antibody is capable of binding to a linear epitope-containing polypeptide 

20 comprising at least 6 consecutive amino acids, preferably at least 8 to 10 consecutive amino acids, 
more preferably at least 12, 15, 20, 25, 30, 40, 50, or 100 consecutive amino acids of a 
polypeptides of the present invention. In another aspect of this embodiment, the antibody is 
capable of binding to a nonlinear epitope-containing polypeptide comprising 10 amino acids in 
length, further preferably 12, 15, 20, 25, 30, 35, 40, 50, 60, 75, or 100 amino acids, further 

25 preferably, a contiguous surface of the native conformation of a polypeptide of the present 

application. Additionally, the antibody is capable of binding a nonlinear epitope presented by a 
synthetic peptide designed to mimic a contiguous surface of the native conformation of a 
polypeptide of a sequence selected from the group consisting of GENSET polypeptides. 
Antibodies that bind linear epitopes may be used in combination with antibodies that bind 

30 nonlinear epitopes for instance, in assays that detect proper protein folding. 

Antibodies of the present invention may also be described or specified in terms of their 
cross-reactivity. Antibodies that do not specifically bind any other analog, ortholog, or homologue 
of the polypeptides of the present invention are included. Antibodies that do not bind polypeptides 
with less than 95%, less than 90%, less than 85%, less than 80%, less than 75%, less than 70%, 

35 less than 65%, less than 60%, less than 55%, and less than 50% identity (as calculated using 
methods known in the art and described herein, e.g., using FASTDB and the parameters set forth 
herein) to a polypeptide of the present invention are also included in the present invention. Further 
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included in the present invention are antibodies, which only bind polypeptides encoded by 
polynucleotides, which hybridize to a polynucleotide of the present invention under stringent 
hybridization conditions (as described herein). Antibodies of the present invention may also be 
described or specified in terms of their binding affinity. Preferred binding affinities include those 
5 with a dissociation constant or Kd less than 5X10"^, 10"^, 5X10" 7 M, 10" 7 M, 5X10" 8 M, 1(T 8 M, 
5X1G*M, 1<T*M, 5X10- 10 M, 10- IO M,.5Xl(r n M, l(r n M, 5X10" ,2 M, 10" 12 M, 5X10" ,3 M, 10' ,3 M, 
5X10- 14 M, 10' I4 M, 5X10- 15 M, and 10' I5 M. 

The invention also concerns a purified or isolated antibody capable of specifically binding 
to a mutated GENSET protein or to a fragment or variant thereof comprising an epitope of the 

10 mutated GENSET protein. 
Preparation of antibodies 

The antibodies of the present invention may be prepared by any suitable method known in 
the art. Some of these methods are described in more detail in the example entitled "Example 1 : 
Preparation of Antibody Compositions to the GENSET protein". For example, a polypeptide of 

15 the present invention or an antigenic fragment thereof can be administered to an animal in order to 
induce the production of sera containing "polyclonal antibodies". As used herein, the term 
"monoclonal antibody" is not limited to antibodies produced through hybridoma technology but it 
rather refers to an antibody, that is derived from a single clone, including eukaryotic, prokaryotic, 
or phage clone, and not the method by which it is produced. Monoclonal antibodies can be 

20 prepared using a wide variety of techniques known in the art including the use of hybridoma, 
recombinant, and phage display technology. 

Hybridoma techniques include those known in the art [see, e.g., Harlow and Lane, (1988) 
Antibodies A Laboratory Manual. Cold Spring Harbor Laboratory, pp. 53-242; Hammerling 
(1981), Monoclonal Antibodies and T-Cell Hybridomas, Elsevier, N.Y. 563-681; said references 

25 incorporated by reference in their entireties] . Fab and F(ab')2 fragments may be produced, for 
example, from hybridoma-produced antibodies by proteolytic cleavage, using enzymes such as 
papain (to produce Fab fragments) or pepsin (to produce F(ab*)2 fragments). 

Alternatively, antibodies of the present invention can be produced through the application 
of recombinant DNA technology or through synthetic chemistry using methods known in the art. 

30 For example, the antibodies of the present invention can be prepared using various phage display 
methods known in the art. In phage display methods, functional antibody domains are displayed 
on the surface of a phage particle, which carries polynucleotide sequences encoding them. Phage 
with a desired binding property are selected from a repertoire or combinatorial antibody library 
(e.g. human or murine) by selecting directly with antigen, typically antigen bound or captured to a 

35 solid surface or bead. Phage used in these methods are typically filamentous phage including fd 
and M13 with Fab, Fv or disulfide stabilized Fv antibody domains recombinant^ fused to either 
the phage gene HI or gene VIII protein. Examples of phage display methods that can be used to 
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make the antibodies of the present invention include those disclosed in Brinkman et al, (1995j 
J. Immunol Methods, 182:41-50; Ames et al, (1995), J. Immunol. Meth., 184:177-186.; 
Kettleborough et al 9 (1994), Eur. L Immunol., 24:952-958; Persic et al, (1997), Gene, 1879-81; 
Burton etal (1994), Adv. Immunol., 57:191-280; PCT/GB9 1/0 1134; WO 90/02809; WO 
5 91/10737; WO 92/01047; WO 92/18619; WO 93/11236; WO 95/15982; WO 95/20401; and US 
Patents 5,698,426, 5,223,409, 5,403,484, 5,580,717, 5,427,908, 5,750,753, 5,821,047, 5,571,698, 
5,427,908, 5,516,637, 5,780,225, 5,658,727 and 5,733,743 (said references incorporated by 
reference in their entireties). 

As described in the above references, after phage selection, the antibody coding regions 

10 from the phage can be isolated and used to generate whole antibodies, including human antibodies, 
or any other desired antigen binding fragment, and expressed in any desired host including 
mammalian cells, insect cells, plant cells, yeast, and bacteria. For example, techniques to 
recombinantly produce Fab, Fab' F(ab)2 and F(ab r )2 fragments can also be employed using 
methods known in the art such as those disclosed in WO 92/22324; Mullinax et al, (1992), 

15 BioTechniques. 12(6):864-869; and Sawai et al, (1995), AJRI 34:26-34; and Better et al, (1988), 
Science. 240:1041-1043 (said references incorporated by reference in their entireties). 

Examples of techniques which can be used to produce single-chain Fvs and antibodies 
include those described in U.S. Patents 4,946,778 and 5,258,498; Huston et al, (1991), Meth. 
Enymol. 203:46_88; Shu, et al, (1993), Proc. Natl. Acad. Sci. U.S.A. 90:7995-7999; and Skerra, et 

20 al, (1988), Science 240:1038-1040, which disclosures are hereby incorporated by reference in 
their entireties. For some uses, including in vivo use of antibodies in humans and in vitro detection 
assays, it may be preferable to use chimeric, humanized, or human antibodies. Methods for 
producing chimeric antibodies are known in the art. See e.g., Morrison, (1985); Oi et al, (1986), 
BioTechniques 4:214; Gillies et al, (1989), J. Immunol Methods. 125:191-202; and US Patent 

25 5,807,715, which disclosures are hereby incorporated by reference in their entireties. Antibodies 
can be humanized using a variety of techniques including CDR-grafting (EP 0 239 400; WO 
91/09967; US Patent 5,530,101; and 5,585,089), veneering or resurfacing [EP 0 592 106; EP 0 519 
596; Padlan (1991), Molec. Immunol. 28(4/5) :489-498; Studnicka etal, (1994), Protein 
Engineering. 7(6):805-814; Roguska et al, (1994), Proc. Natl. Acad. Sci. U.S.A. 91:969-973], and 

30 chain shuffling (US Patent 5,565,332), which disclosures are hereby incorporated by reference in 
their entireties. Human antibodies can be made by a variety of methods known in the art including 
phage display methods described above. See also, US Patents 4,444,887, 4,716,1 1 1, 5,545,806, 
and 5,814,318; WO 98/46645; WO 98/50433; WO 98/24893; WO 96/34096; WO 96/33735; and 
WO 91/10741 (said references incorporated by reference in their entireties). 

35 Further included in the present invention are antibodies recombinantly fused or chemically 

conjugated (including both covalent and non-covalent conjugations) to a polypeptide of the present 
invention. The antibodies may be specific for antigens other than polypeptides of the present 
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invention. For example, antibodies of the present invention may be recombinant^ fused or 
conjugated to molecules useful as labels in detection assays and effector molecules such as 
heterologous polypeptides, drugs, or toxins. See, e.g., WO 92/08495; WO 91/14438; WO 
89/12624; US Patent 5,314,995; and EP 0 396 387, which disclosures are hereby incorporated by 
5 reference in their entireties. Fused antibodies may also be used to target the polypeptides of the 
present invention to particular cell types, either in vitro or in vivo, by fusing or conjugating the 
polypeptides of the present invention to antibodies specific for particular cell surface receptors. 
Antibodies fused or conjugated to the polypeptides of the present invention may also be used in 
vitro immunoassays and purification methods using methods known in the art [see e.g., Harbor, et 

10 al. supra; WO 93/21232; EP 0 439 095; Naramura et al 9 (1994), Immunol. Lett. 39:91-99; US 
Patent 5,474,981; Gillies et aL, (1992), Proc Natl Acad Sci U S A 89:1428-1432; Fell et aL, 
(1991), J. Immunol. 146:2446-2452; said references incorporated by reference in their entireties]. 

The present invention further includes compositions comprising the polypeptides of the 
present invention fused or conjugated to antibody domains other than the variable regions. For 

15 example, the polypeptides of the present invention may be fused or conjugated to an antibody Fc 
region, or portion thereof. The antibody portion fused to a polypeptide of the present invention 
may comprise the hinge region, CHI domain, CH2 domain, and CH3 domain or any combination 
of whole domains or portions thereof. The polypeptides of the present invention may be fused or 
conjugated to the above antibody portions to increase the in vivo half-life of the polypeptides or for 

20 use in immunoassays using methods known in the art. The polypeptides may also be fused or 
conjugated to the above antibody portions to form multimers. For example, Fc portions fused to 
the polypeptides of the present invention can form dimers through disulfide bonding between the 
Fc portions. Higher multimeric forms can be made by fusing the polypeptides to portions of IgA 
and IgM. Methods for fusing or conjugating the polypeptides of the present invention to antibody 

25 portions are known in the art. See e.g., US Patents 5,336,603, 5,622,929, 5,359,046, 5,349,053, 
5,447,851, 5,1 12,946; EP 0 307 434, EP 0 367 166; WO 96/04388, WO 91/06570; Ashkenazi et 
aL, (1991), Proc. Natl. Acad. Sci. USA 88:10535-10539; Zheng, XX, et aL (1995), J. Immunol. 
154:5590-5600; and Vil, et aL (1992), Proc Natl Acad Sci U S 89:1 1337-1 1341 (said references 
incorporated by reference in their entireties). 

30 Non-human animals or mammals, whether wild-type or transgenic, which express a 

different species of GENSET than the one to which antibody binding is desired, and animals which 
do not express GENSET (i.e. a GENSET knock out animal as described herein) are particularly 
usefid for preparing antibodies. GENSET knock out animals will recognize all or most of the 
exposed regions of a GENSET protein as foreign antigens, and therefore produce antibodies with a 

35 wider array of GENSET epitopes. Moreover, smaller polypeptides with only 10 to 30 amino acids 
may be useful in obtaining specific binding to any one of the GENSET proteins. In addition, the 
humoral immune system of animals which produce a species of GENSET that resembles the 
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antigenic sequence will preferentially recognize the differences between the animal's native 
GENSET species and the antigen sequence, and produce antibodies to these unique sites in the 
antigen sequence. Such a technique will be particularly useful in obtaining antibodies that 
specifically bind to any one of the GENSET proteins. 
5 A preferred embodiment of the invention is a method of specifically binding an antibody 

or antibody fragment to a GENSET polypeptide. This method comprises the step of contacting a 
GENSET polypeptide-specific antibody or fragment thereof with a GENSET polypeptide under 
antibody-binding conditions. Further included is a method of specifically binding an antibody or 
antibody fragment to an epitope, domain, or fragment of a GENSET polypeptide. This method 
1 0 may be used to, for example, detect, purify, or modify the activity of GENSET polypeptides, as 
disussed herein. 

Antibodies of the invention can be used to assay protein levels in a test sample or 
biological sample using methods known to those of skill in the art. Antibody-based methods 
useful for detecting protein include immunoassays, such as the enzyme linked immunosorbent 

15 assay (ELISA) and radioimmunoassay (RIA). Suitable antibody assay labels are known in the art 
and include enzyme labels, such as glucose oxidase, horseradish peroxidase, and alkaline 
phosphatase; radioisotopes, such as iodine (1251, 1211), carbon (14C), sulfur (35S), tritium (3H), 
indium (121In), and technetium (99Tc); luminescent labels, such luminol, isolumino, theromatic 
acridinium ester, imidazole, acridinium salt, oxalate ester, luciferin, luciferase, and aequorin; and 

20 fluorescent labels, such as fluorescein isothiocyanate, rhodamine, phycoerythrin, phycocyanin, 
allophycocyanin, o-phthaldehyde, and fluorescamine. 

Uses of polynucleotides 
Uses of polynucleotides as reagents 

The polynucleotides of the present invention may be used as reagents in isolation 

25 procedures, diagnostic assays, and forensic procedures. For example, sequences from the 

GENSET polynucleotides of the invention may be detectably labeled and used as probes to isolate 
other sequences capable of hybridizing to them. In addition, sequences from the GENSET 
polynucleotides of the invention may be used to design PCR primers to be used in isolation, 
diagnostic, or forensic procedures. 

30 To find corresponding genomic DNA sequences 

The GENSET cDNAs of the invention may also be used to clone sequences located 
upstream of the cDNAs of the invention on the corresponding genomic DNA. Such upstream 
sequences may be capable of regulating gene expression, including promoter sequences, enhancer 
sequences, and other upstream sequences which influence transcription or translation levels. Once 

35 identified and cloned, these upstream regulatory sequences may be used in expression vectors 
designed to direct the expression of an inserted gene in a desired spatial, temporal, developmental, 
or quantitative fashion. 
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Use of cDNAs or Fragments thereof to Clone Upstream Sequences from Genomic DNA 

Sequences derived from polynucleotides of the inventions may be used to isolate (he 
promoters of the corresponding genes using chromosome walking techniques. In one chromosome 
walking technique, the GenomeWalker™ kit available from Clontechis used according to the 
5 manufacturer's instructions. 

Identification of Promoters in Cloned Upstream Sequences 

Once the upstream genomic sequences have been cloned and sequenced, prospective 
promoters and transcription start sites within the upstream sequences may be identified by 
comparing the sequences upstream of the polynucleotides of the inventions with databases 

10 containing known transcription start sites, transcription factor binding sites, or promoter sequences. 
In addition, promoters in the upstream sequences may be identified using promoter 
reporter vectors as follows. The expression of the reporter gene will be detected when placed 
under the control of regulatory active polynucleotide fragments or variants of the GENSET 
promoter region located upstream of the first exon of the GENSET gene. Suitable promoter 

1 5 reporter vectors, into which the GENSET promoter sequences may be cloned include pSEAP- 
Basic, pSEAP-Enhancer, ppgal-Basic, ppgal-Enhancer, or pEGFP-1 Promoter Reporter vectors 
available from Clontech, or pGL2-basic or pGL3-basic promoterless luciferase reporter gene 
vector from Promega. Briefly, each of these promoter reporter vectors include multiple cloning 
sites positioned upstream of a reporter gene encoding a readily assayable protein such as secreted 

20 alkaline phosphatase, luciferase, beta-galactosidase, or green fluorescent protein. The sequences 
upstream the GENSET coding region are inserted into the cloning sites upstream of the reporter 
gene in both orientations and introduced into an appropriate host cell. The level of reporter protein 
is assayed and compared to the level obtained from a vector which lacks an insert in the cloning 
site. The presence of an elevated expression level in the vector containing the insert with respect 

25 to the control vector indicates the presence of a promoter in the insert If necessary, the upstream 
sequences can be cloned into vectors which contain an enhancer for increasing transcription levels 
from weak promoter sequences. A significant level of expression above that observed with the 
vector lacking an insert indicates that a promoter sequence is present in the inserted upstream 
sequence. Promoter sequence within the upstream genomic DNA may be further defined by site 

30 directed mutagenesis, linker scanning analysis, or other techniques familiar to those skilled in the 
art. 

The strength and the specificity of the promoter of each GENSET gene can be assessed 
through the expression levels of a detectable polynucleotide operably linked to the GENSET 
promoter in different types of cells and tissues. The detectable polynucleotide may be either a 
35 polynucleotide that specifically hybridizes with a predefined oligonucleotide probe, or a 

polynucleotide encoding a detectable protein, including a GENSET polypeptide or a fragment or a 
variant thereof. This type of assay is well known to those skilled in the art and is described in US 
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Patent No. 5,502,176; and US Patent No. 5,266,488; the disclosures of which are incorporated by 
reference herein in their entirety. Some of the methods are discussed in more detail elsewhere in 
the application. 

The promoters and other regulatory sequences located upstream of the polynucleotides of 
5 the inventions may be used to design expression vectors capable of directing the expression of an 
inserted gene in a desired spatial, temporal, developmental, or quantitative manner. A promoter 
capable of directing the desired spatial, temporal, developmental, and quantitative patterns may be 
selected using the results of the expression analysis described herein. For example, if a promoter 
which confers a high level of expression in muscle is desired, the promoter sequence upstream of a 

10 polynucleotide of the invention derived from an mRNA which is expressed at a high level in 
muscle may be used in the expression vector. 
To find similar sequences 

Polynucleotides of the invention may be used to isolate and/or purify nucleic acids similar 
thereto using any methods well known to those skilled in the art including the techniques based on 

1 5 hybridization or on amplification described in this section. These methods may be used to obtain 
the genomic DNAs which encode the mRNAs from which the GENSET cDNAs are derived, 
mRNAs corresponding to GENSET cDNAs, or nucleic acids which are homologous to GENSET 
cDNAs or fragments thereof, such as variants, species homologues or orthologs. 
Hybridization-based methods 

20 Techniques for identifying cDNA clones in a cDNA library which hybridize to a given 

probe sequence are disclosed in Sambrook et al, (1989) Molecular Cloning: A Laboratory 
Manual. (2ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, New York), and in Hames 
and Higgins (1985) Nucleic Acid Hybridization: A Practical Approach (Hames and Higgins Ed., 
IRL Press, Oxford), the disclosures of which are incorporated herein by reference in their 

25 entireties. The same techniques may be used to isolate genomic DNAs. 

A probe comprising at least 10 consecutive nucleotides from a GENSET cDNA or 
fragment thereof is labeled with a detectable label such as a radioisotope or a fluorescent molecule. 

Techniques for labeling the probe are well known and include phosphorylation with 
polynucleotide kinase, nick translation, in vitro transcription, and non radioactive techniques. The 

30 cDNAs or genomic DNAs in the libraiy are transferred to a nitrocellulose or nylon filter and 

denatured. After blocking of nonspecific sites, the filter is incubated with the labeled probe for an 
amount of time sufficient to allow binding of the probe to cDNAs or genomic DNAs containing a 
sequence capable of hybridizing thereto. 

By varying the stringency of the hybridization conditions used to identify cDNAs or 

35 genomic DNAs which hybridize to the detectable probe, cDNAs or genomic DNAs having 
different levels of identity to the probe can be identified and isolated as described below. 
Stringent conditions 
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"Stringent hybridization conditions" are defined as conditions in which only nucleic acids 
having a high level of identity to the probe are able to hybridize to said probe. These conditions 
may be calculated as follows: 

For probes between 14 and 70 nucleotides in length the melting temperature (Tm) is 
5 calculated using the formula: Tm=81.5+16.6(log (Na+))4041(fraction GH<:H600/N) whereNis 
the length of the probe. 

If the hybridization is carried out in a solution containing formamide, the melting 
temperature may be calculated using the equation: Tm=81.5+16.6(log (Na+))+0.41 (fraction 
G+C)-(0.63% formamide>(600/N) where N is the length of the probe. 
10 Prehybridization may be carried out in 6X SSC, 5X Denhardt's reagent, 0.5% SDS, 100 \ig 

denatured fragmented salmon sperm DNA or 6X SSC, 5X Denhardt's reagent, 0.5% SDS, 100 \ig 
denatured fragmented salmon sperm DNA, 50% formamide. The formulas for SSC and 
Denhardt's solutions are listed in Sambrook et aL, 1986. 

Hybridization is conducted by adding the detectable probe to the prehybridization solutions 
15 listed above. Where the probe comprises double stranded DNA, it is denatured before addition to 
the hybridization solution. The filter is contacted with the hybridization solution for a sufficient 
period of time to allow the probe to hybridize to nucleic acids containing sequences 
complementary thereto or homologous thereto. For probes over 200 nucleotides in length, the 
hybridization may be carried out at 15-25°C below the Tm. For shorter probes, such as 
20 oligonucleotide probes, the hybridization may be conducted at 1 5-25°C below the Tm. Preferably, 
for hybridizations in 6X SSC, the hybridization is conducted at approximately 68°C. Preferably, 
for hybridizations in 50% formamide containing solutions, the hybridization is conducted at 
approximately 42°C. 

Following hybridization, the filter is washed in 2X SSC, 0.1% SDS at room temperature 
25 for 15 minutes. The filter is then washed with 0. IX SSC, 0.5% SDS at room temperature for 30 
minutes to 1 hour. Thereafter, the solution is washed at the hybridization temperature in 0.1X 
SSC, 0.5% SDS. A final wash is conducted in 0.1X SSC at room temperature. 

Nucleic acids which have hybridized to the probe are identified by autoradiography or 
other conventional techniques. 
30 Low and moderate conditions 

Changes in the stringency of hybridization and signal detection are primarily accomplished 
through the manipulation of formamide concentration (lower percentages of formamide result in 
lowered stringency); salt conditions, or temperature. The above procedure may thus be modified 
to identify nucleic acids having decreasing levels of identity to the probe sequence. For example, 
35 the hybridization temperature may be decreased in increments of 5°C from 68°C to 42°C in a 
hybridization buffer having a sodium concentration of approximately 1M. Following 
hybridization, the filter may be washed with 2X SSC, 0.5% SDS at the temperature of 
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hybridization. These conditions are considered to be "moderate" conditions above 50°C and "low" 
conditions below 50°C. Alternatively, the hybridization may be carried out in buffers, such as 6X 
SSC, containing formamide at a temperature of 42°C. In this case, the concentration of formamide 
in the hybridization buffer may be reduced in 5% increments from 50% to 0% to identify clones 
5 having decreasing levels of identity to the probe. Following hybridization, the filter may be 
washed with 6X SSC, 0.5% SDS at 50°C. These conditions are considered to be "moderate" 
conditions above 25% formamide and "low" conditions below 25% formamide. cDNAs or 
genomic DNAs which have hybridized to the probe are identified by autoradiography or other 
conventional techniques. 

1 0 Note that variations in the above conditions may be accomplished through the inclusion 

and/or substitution of alternate blocking reagents used to suppress background in hybridization 
experiments. Typical blocking reagents include Denhardt's reagent, BLOTTO, heparin, denatured 
salmon sperm DNA, and commercially available proprietary formulations. The inclusion of 
specific blocking reagents may require modification of the hybridization conditions described 

1 5 above, due to problems with compatibility. 

Consequently, the present invention encompasses methods of isolating nucleic acids 
similar to the polynucleotides of the invention, comprising the steps of: 

a) contacting a collection of cDNA or genomic DNA molecules with a detectable 
probe comprising at least 12, 15, 1 8, 20, 23, 25, 28, 30, 35, 40 or 50 consecutive 

20 nucleotides of a polynucleotide of the present invention under stringent, moderate 

or low conditions which permit said probe to hybridize to at least a cDNA or 
genomic DNA molecule in said collection; 

b) identifying said cDNA or genomic DNA molecule which hybridizes to said 
detectable probe; and 

25 c) isolating said cDNA or genomic DNA molecule which hybridized to said probe. 
PCR-based methods 

In addition to the above described methods, other protocols are available to obtain 
homologous cDNAs using GENSET cDNA of the present invention or fragment thereof as 
outlined in the following paragraphs. 
30 cDNAs may be prepared by obtaining mRNA from the tissue, cell, or organism of interest 

using mRNA preparation procedures utilizing polyA selection procedures or other techniques 
known to those skilled in the art. A first primer capable of hybridizing to the polyA tail of the 
mRNA is hybridized to the mRNA and a reverse transcription reaction is performed to generate a 
first cDNA strand. 

35 The term "capable of hybridizing to the polyA tail of said mRNA" refers to and embraces 

all primers containing stretches of thymidine residues, so-called oligo(dT) primers, that hybridize 
to the 3* end of eukaryotic poly(A)+ mRNAs to prime the synthesis of a first cDNA strand. 
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Techniques for generating said oligo (dT) primers and hybridizing them to mRNA to subsequently 
prime the reverse transcription of said hybridized mRNA to generate a first cDNA strand are well 
known to those skilled in the art and are described in Current Protocols in Molecular Biology, John 
Wiley and Sons, Inc. 1997 and Sambrook, et a/., 1989. Preferably, said oligo (dT) primers are 

5 present in a large excess in order to allow the hybridization of all mRNA 3 'ends to at least one 
oligo (dT) molecule. The priming and reverse transcription steps are preferably performed 
between 37°C and 55°C depending on the type of reverse transcriptase used. Preferred oligo(dT) 
primers for priming reverse transcription of mRNAs are oligonucleotides containing a stretch of 
thymidine residues of sufficient length to hybridize specifically to the polyA tail of mRNAs, 

10 preferably of 12 to 18 thymidine residues in length. More preferably, such oligo(T) primers 

comprise an additional sequence upstream of the poly(dT) stretch in order to allow the addition of 
a given sequence to the 5 'end of all first cDNA strands which may then be used to facilitate 
subsequent manipulation of the cDNA. Preferably, this added sequence is 8 to 60 residues in 
length. For instance, the addition of a restriction site in 5' of cDNAs facilitates subcloning of the 

15 obtained cDNA. Alternatively, such an added 5' end may also be used to design primers of PCR 
to specifically amplify cDNA clones of interest. 

The first cDNA strand is then hybridized to a second primer. Any appropriate 
polynucleotide fragment of the invention may be used. This second primer contains at least 10 
consecutive nucleotides of a polynucleotide of the invention. Preferably, the primer comprises at 

20 least 10, 12, 15, 17, 18, 20, 23, 25, or 28 consecutive nucleotides of a polynucleotide of the 
invention. In some embodiments, the primer comprises more than 30 nucleotides of a 
polynucleotide of the invention. If it is desired to obtain cDNAs containing the full protein coding 
sequence, including the authentic translation initiation site, the second primer used contains 
sequences located upstream of the translation initiation site. The second primer is extended to 

25 generate a second cDNA strand complementary to the first cDNA strand. Alternatively, RT-PCR . 
may be performed as described above using primers from both ends of the cDNA to be obtained. 

The double stranded cDNAs made using the methods described above are isolated and 
cloned. The cDNAs may be cloned into vectors such as plasmids or viral vectors capable of 
replicating in an appropriate host cell. For example, the host cell may be a bacterial, mammalian, 

30 avian, or insect cell. 

Techniques for isolating mRNA, reverse transcribing a primer hybridized to mRNA to 
generate a first cDNA strand, extending a primer to make a second cDNA strand complementary to 
the first cDNA strand, isolating the double stranded cDNA and cloning the double stranded cDNA 
are well known to those skilled in the art and are described in Current Protocols in Molecular 

35 Biology, John Wiley & Sons, Inc. 1997 and Sambrook, et al, 1989. 

Consequently, the present invention encompasses methods of making cDNAs. In a first 
embodiment, the method of making a cDNA comprises the steps of: 
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a) contacting a collection of mRNA molecules from human cells with a primer 
comprising at least 12, 15, 18, 20, 23, 25, 28, 30, 35, 40, or 50 consecutive 
nucleotides of a sequence selected from the group consisting of the polynucleotide 
sequences complementary to the polynucleotide sequences of the Sequence Listing 

5 and those complementary to a human cDNA clone insert of the deposited clone 

pool; 

b) hybridizing said primer to an mRNA in said collection; 

c) reverse transcribing said hybridized primer to make a first cDNA strand from said 
mRNA; 

10 . d) . making a second cDNA strand complementary to said first cDNA strand; and 

e) isolating the resulting cDNA comprising said first cDNA strand and said second 
cDNA strand. 

Another embodiment of the present invention is a purified cDNA obtainable by the method 
of the preceding paragraph. In one aspect of this embodiment, the cDNA encodes at least a portion 
15 of a human polypeptide. 

In a second embodiment, the method of making a cDNA comprises the steps of: 

a) contacting a collection of mRNA molecules from human cells with a first primer 
capable of hybridizing to the polyA tail of said mRNA; 

b) hybridizing said first primer to said polyA tail; 

20 c) reverse transcribing said mRNA to make a first cDNA strand; 

d) making a second cDNA strand complementary to said first cDNA strand using at 
least one primer comprising at least 12, 15, 18, 20, 23, 25, 28, 30, 35, 40, or 50 
consecutive nucleotides of a sequence selected from the group consisting of 
polynucleotide sequences of the Sequence Listing and those of human cDNA 

25 clone inserts of the deposited clone pool; and 

e) isolating the resulting cDNA comprising said first cDNA strand and said second 
cDNA strand. 

In another aspect of this method the second cDNA strand is made by: 

a) contacting said first cDNA strand with a second primer comprising at least 12, 15, 
30 1 8, 20, 23, 25, 28, 30, 35, 40, or 50 consecutive nucleotides of a sequence selected 

from the group consisting of polynucleotide sequences of the Sequence Listing 
and those of human cDNA clone inserts of the deposited clone pool, and a third 
primer which sequence is fully included within the sequence of said first primer; 

b) performing a first polymerase chain reaction with said second and third primers to 
35 generate a first PCR product; 

c) contacting said first PCR product with a fourth primer, comprising at least 12, 15, 
18, 20, 23, 25, 28, 30, 35, 40, or 50 consecutive nucleotides of said sequence 
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selected from the group consisting of polynucleotide sequences of the Sequence 
Listing and those of human cDNA clone inserts of the deposited clone pool, and a 
fifth primer, which sequence is fully included within the sequence of said third 
primer, wherein said fourth and fifth hybridize to sequences within said first PGR 
5 product; and 

d) performing a second polymerase chain reaction, thereby generating a second PCR 
product. 

Alternatively, the second cDNA strand may be made by contacting said first cDNA strand 
with a second primer comprising at least 12, 15, 18, 20, 23, 25, 28, 30, 35, 40, or 50 consecutive 
10 nucleotides of a sequence selected from the group consisting of polynucleotide sequences of the 
Sequence Listing and human cDNA clone inserts of the deposited clone pool, and a third primer 
which sequence is fully included within the sequence of said first primer and performing a 
polymerase chain reaction with said second and third primers to generate said second cDNA 
strand. 

1 5 Alternatively, the second cDNA strand may be made by: 

a) contacting said first cDNA strand with a second primer comprising at least 12, 15, 
18, 20, 23, 25, 28, 30, 35, 40, or 50 consecutive nucleotides of a sequence selected 
from the group consisting of polynucleotide sequences of the Sequence Listing 
and human cDNA clone inserts of the deposited clone pool; 
20 b) hybridizing said second primer to said first strand cDNA; and 

c) extending said hybridized second primer to generate said second cDNA strand 
Another embodiment of the present invention is a purified cDNA obtainable by a method 
of making a cDNA of the invention. In one aspect of this embodiment, said cDNA encodes at least 
a portion of a human polypeptide. 
25 Other protocols 

Alternatively, other procedures may be used for obtaining homologous cDNAs. In one 
approach, cDNAs are prepared from mRNA and cloned into double stranded phagemids as 
follows. The cDNA library in the double stranded phagemids is then rendered single stranded by 
treatment with an endonuclease, such as the Gene II product of the phage Fl and an exonuclease 

30 [Chang et al., (1993) Gene 127:95-8, which disclosure is hereby incorporated by reference in its 
entirety]. A biotinylated oligonucleotide comprising the sequence of a fragment of a known 
GENSET cDNA, genomic DNA or fragment thereof is hybridized to the single stranded 
phagemids. Preferably, the fragment comprises at least 10, 12, 15, 17, 18, 20, 23, 25, or 28 
consecutive nucleotides of a polynucleotide of the present invention. 

35 Hybrids between the biotinylated oligonucleotide and phagemids are isolated by 

incubating the hybrids with streptavidin coated paramagnetic beads and retrieving the beads with a 
magnet [Fry et aL, (1992) Biotechniques, 13: 124-131, which disclosure is hereby incorporated by 
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reference in its entirety]. Thereafter, the resulting phagemids are released from the beads and 
converted into double stranded DNA using a primer specific for the GENSET cDNA or fragment 
used to design the biotinylated oligonucleotide. Alternatively, protocols such as the Gene Trapper 
kit (Gibco BRL), which disclosure is which disclosure is hereby incorporated by reference in its 
5 entirety, may be used. The resulting double stranded DNA is transformed into bacteria. 

Homologous cDNAs to the GENSET cDNA or fragment thereof sequence are identified by colony 
PCR or colony hybridization. 
As a chromosome marker 

GENSET polynucleotides may be mapped to their chromosomal locations using any 
10 methods or techniques known to those skilled in the art including radiation hybrid (RH) mapping, 
PCR-based mapping and Fluorescence in situ hybridization (FISH) mapping described below. 
Radiation hybrid mapping 

Radiation hybrid (RH) mapping is a somatic cell genetic approach that can be used for 
high resolution mapping of the human genome. [See, e.g., Benham et al (1989) Genomics 4:509- 
15 517 and Cox et al, (1990) Science 250:245-250; and Schuler et al, (1996) Science 274:540-546], 
which disclosure is hereby incorporated by reference in its entirety.] 
Mapping ofcDNAs to Human Chromosomes using PCR techniques 

GENSET cDNAs and genomic DNAs may be assigned to human chromosomes using PCR 
based methodologies. ' In such approaches, oligonucleotide primer pairs are designed from the 
20 cDNA sequence to minimize the chance of amplifying through an intron. Preferably, the 
oligonucleotide primers are 18-23 bp in length and are designed for PCR amplification. The 
creation of PCR primers from known sequences is well known to those with skill in the art. For a 
review of PCR technology see Erlich (1992), which disclosure is hereby incorporated by reference 
■ in its entirety. 

25 PCR is used to screen a series of somatic cell hybrid cell lines containing defined sets of 

human chromosomes for the presence of a given cDNA or genomic DNA. DNA is isolated from 
the somatic hybrids and used as starting templates for PCR reactions using the primer pairs from 
the GENSET cDNAs or genomic DNAs. Only those somatic cell hybrids with chromosomes 
containing the human gene corresponding to the GENSET cDNA or genomic DNA will yield an 

30 amplified fragment. The GENSET cDNAs or genomic DNAs are assigned to a chromosome by 
analysis of the segregation pattern of PCR products from the somatic hybrid DNA templates. The 
single human chromosome present in all cell hybrids that give rise to an amplified fragment is the 
chromosome containing that GENSET cDNA or genomic DNA. For a review of techniques and 
analysis of results from somatic cell gene mapping experiments, see Ledbetter etai, (1990) 

35 Genomics 6:475-48 1 , which disclosure is hereby incorporated by reference in its entirety. 
Mapping of cDNAs to Chromosomes Using Fluorescence in situ Hybridization 

Fluorescence in situ hybridization (FISH) allows the GENSET cDNA or genomic DNA to 



76 



WO 02/094864 



PCT/IB01/01715 



be mapped to a particular location on a given chromosome. The chromosomes to be used for 
fluorescence in situ hybridization techniques may be obtained from a variety of sources including 
cell cultures, tissues, or whole blood. 

In a preferred embodiment, chromosomal localization of a GENSET cDNA or genomic 
5 DNA is obtained by FISH as described by Cherif et aL, (1990), "Simultaneous Localization of 
Cosmids and Chromosome R-Banding by Fluorescence Microscopy: Application to Regional 
Mapping of Human Chromosome 11", Proc. Natl. Acad. Sci. U.S.A., 87:6639-6643, which 
disclosure is hereby incorporated by reference in its entirety. For chromosomal localization, 
fluorescent R-bands are obtained as previously described (Cherif, et aL, 1990, supra). 

10 Use of cDNAs to Construct or Expand Chromosome Maps 

Once the GENSET cDNAs or genomic DNAs have been assigned to particular 
chromosomes using any technique known to those skilled in the art those skilled in the art, 
particularly those described herein, they may be utilized to construct a high resolution map of the 
chromosomes on which they are located or to identify the chromosomes in a sample. 

1 5 Chromosome mapping involves assigning a given unique sequence to a particular 

chromosome as described above. Once the unique sequence has been mapped to a given 
chromosome, it is ordered relative to other unique sequences located on the same chromosome. 
One approach to chromosome mapping utilizes a series of yeast artificial chromosomes (YACs) 
bearing several thousand long inserts derived from the chromosomes of the organism from which 

20 the GENSET cDNAs or genomic DNAs are obtained. This approach is described in Nagaraja et 
aL, (1997) "X chromosome map at 75-kb STS resolution, revealing extremes of recombination and 
GC content", Genome Res. 1997 Mar;7(3):210-22, which disclosure is hereby incorporated by 
reference in its entirety. 

Identification of genes associated with hereditary diseases or drug response 
25 This example illustrates an approach useful for the association of GENSET cDNAs or 

genomic DNAs with particular phenotypic characteristics. In this example, a particular GENSET 
cDNA or genomic DNA is used as a test probe to associate that GENSET cDNA or genomic DNA 
with a particular phenotypic characteristic. 

GENSET cDNAs or genomic DNAs are mapped to a particular location on a human 
30 chromosome using techniques such as those described herein or other techniques known in the art. 
A search of Mendelian Inheritance in Man (V. McKusick, Mendelian Inheritance in Man; available 
on line through Johns Hopkins University Welch Medical Library) reveals the region of the human 
chromosome which contains the GENSET cDNA or genomic DNA to be a very gene rich region 
containing several known genes and several diseases or phenotypes for which genes have not been 
35 identified. The gene corresponding to this GENSET cDNA or genomic DNA thus becomes an 
immediate candidate for each of these genetic diseases. 

Cells from patients with these diseases or phenotypes are isolated and expanded in culture. 
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PCR primers from the GENSET cDNA or genomic DNA are used to screen genomic DNA, 
mRNA or cDNA obtained from the patients. GENSET cDNAs or genomic DNAs that are not 
amplified in the patients can be positively associated with a particular disease by further analysis. 
Alternatively, the PCR analysis may yield fragments of different lengths when the samples are 
5 derived from an individual having the phenotype associated with the disease than when the sample 
is derived from a healthy individual, indicating that the gene containing the cDNA may be 
responsible for the genetic disease. 

Uses of polynucleotides in recombinant vectors 

10 The present invention also relates to recombinant vectors including the isolated 

polynucleotides of the present invention, and to host cells recombinant for a polynucleotide of the 
invention, such as the above vectors, as well as to methods of making such vectors and host cells 
and for using them for production of GENSET polypeptides by recombinant techniques. 
Recombinant Vectors 

1 5 The term 'Vector" is used herein to designate either a circular or a linear DNA or RNA 

molecule, which is either double-stranded or single-stranded, and which comprise at least one 
polynucleotide of interest that is sought to be transferred in a cell host or in a unicellular or 
multicellular host organism. The present invention encompasses a family of recombinant vectors 
that comprise a regulatory polynucleotide and/or a coding polynucleotide derived from either the 

20 GENSET genomic sequence or the cDNA sequence. Generally, a recombinant vector of the 
invention may comprise any of the polynucleotides described herein, including regulatory 
sequences, coding sequences and polynucleotide constructs, as well as any GENSET primer or 
probe as defined herein. 

In a first preferred embodiment, a recombinant vector of the invention is used to amplify 

25 the inserted polynucleotide derived from a GENSET genomic sequence or a GENSET cDNA, for 
example any cDNA selected from the group consisting of polynucleotide sequences of the 
Sequence Listing, those of human cDNA clone inserts of the deposited clone pool, variants and 
fragments thereof in a suitable cell host, this polynucleotide being amplified at every time that the 
recombinant vector replicates. 

30 A second preferred embodiment of the recombinant vectors according to the invention 

comprises expression vectors comprising either a regulatory polynucleotide or a coding nucleic 
acid of the invention, or both. Within certain embodiments, expression vectors are employed to 
express a GENSET polypeptide which can be then purified and, for example be used in ligand 
screening assays or as an immunogen in order to raise specific antibodies directed against the 

35 GENSET protein. In other embodiments, the expression vectors are used for constructing 
transgenic animals and also for gene therapy. Expression requires that appropriate signals are 
provided in the vectors, said signals including various regulatory elements, such as 
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enhancers/promoters from both viral and mammalian sources that drive expression of the genes of 
interest in host cells. Dominant drug selection markers for establishing permanent, stable cell 
clones expressing the products are generally included in the expression vectors of the invention, as 
they are elements that link expression of the drug selection markers to expression of the 
5 polypeptide. 

More particularly, the present invention relates to expression vectors which include nucleic 
acids encoding a GENSET protein, preferably a GENSET protein with an amino acid sequence 
selected from the group consisting of polypeptide sequences of the Sequence Listing, thoseencoded 
by the human cDNA clone inserts of the deposited clone pool, variants and fragments thereof. The 

10 polynucleotides of the present invention may be used to express an encoded protein in a host 
organism to produce a beneficial effect. In such procedures, the encoded protein may be 
transiently expressed in the host organism or stably expressed in the host organism. The encoded 
protein may have any of the activities described herein. The encoded protein may be a protein 
which the host organism lacks or, alternatively, the encoded protein may augment the existing 

1 5 levels of the protein in the host organism. 

Some of the elements which can be found in the vectors of the present invention are 
described in further detail in the following sections. 
General features of the expression vectors of the invention 

A recombinant vector according to the invention comprises, but is not limited to, a YAC 

20 (Y east Artificial Chromosome), a BAC (Bacterial Artificial Chromosome), a phage, a phagemid, a 
cosmid, a plasmid or even a linear DNA molecule which may comprise a chromosomal, non- 
chromosomal, semi-synthetic and synthetic DNA. Such a recombinant vector can comprise a 
transcriptional unit comprising an assembly of: 

(1) a genetic element or elements having a regulatory role in gene expression, for 
25 example promoters or enhancers. Enhancers are cis-acting elements of DNA, 

usually from about 10 to 300 bp in length that act on the promoter to increase the 
transcription. 

(2) a structural or coding sequence which is transcribed into mRNA and eventually 
translated into a polypeptide, said structural or coding sequence being operably 

30 linked to the regulatory elements described in (1); and 

(3) appropriate transcription initiation and termination sequences. Structural units 
intended for use in yeast or eukaryotic expression systems preferably include a 
leader sequence enabling extracellular secretion of translated protein by a host 
cell. Alternatively, when a recombinant protein is expressed without a leader or 

35 transport sequence, it may include a N-terminal residue. This residue may or may 

not be subsequently cleaved from the expressed recombinant protein to provide a 
final product. 
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Generally, recombinant expression vectors will include origins of replication, selectable 
markers permitting transformation of the host cell, and a promoter derived from a highly expressed 
gene to direct transcription of a downstream structural sequence. The heterologous structural 
sequence is assembled in appropriate phase with translation initiation and termination sequences, 
5 and preferably a leader sequence capable of directing secretion of the translated protein into the 
periplasmic space or the extracellular medium. In a specific embodiment wherein the vector is 
adapted for transfecting and expressing desired sequences in mammalian host cells, preferred 
vectors will comprise an origin of replication in the desired host, a suitable promoter and enhancer, 
and also any necessary ribosome binding sites, polyadenylation signals, splice donor and acceptor 
10 sites, transcriptional termination sequences, and 5 5 -flanking non-transcribed sequences. DNA 
sequences derived from the SV40 viral genome, for example SV40 origin, early promoter, 
enhancer, splice and polyadenylation signals may be used to provide the required non-transcribed 
genetic elements. 

The in vivo expression of a GENSET polypeptide of the present invention may be useful in 
1 5 order to correct a genetic defect related to the expression of the native gene in a host organism, for 
the treatment or prevention of any disease or condition that can be treated or prevented by 
increasing the level of GENSET polypeptide expression, or to the production of a biologically 
inactive GENSET protein. Consequently, the present invention also comprises recombinant 
expression vectors mainly designed for the in vivo production of a GENSET polypeptide the 
20 present invention by the introduction of the appropriate genetic material in the organism or the 
patient to be treated. This genetic material may be introduced in vitro in a cell that has been 
previously extracted from the organism, the modified cell being subsequently reintroduced in the 
said organism, directly in vivo into the appropriate tissue. 
Regulatory Elements 

25 The suitable promoter regions used in the expression vectors according to the present 

invention are chosen taking into account the cell host in which the heterologous gene has to be 
expressed. 

A suitable promoter may be heterologous with respect to the nucleic acid for which it 
controls the expression or alternatively can be endogenous to the native polynucleotide containing 

30 the coding sequence to be expressed. Additionally, the promoter is generally heterologous with 
respect to the recombinant vector sequences within which the construct promoter/coding sequence 
has been inserted. Promoter regions can be selected from any desired gene using, for example, 
CAT (chloramphenicol transferase) vectors and more preferably pKK232-8 and pCM7 vectors. 
Preferred bacterial promoters are the Lad, LacZ, the T3 or T7 bacteriophage RNA 

35 polymerase promoters, the gpt, lambda PR, PL and trp promoters (EP 0036776), the polyhedrin 
promoter, or the plO protein promoter from baculovirus (Kit Novagen) [Smith et al, (1983) Mol. 
Cell. Biol. 3:2156-2165; O'Reilly et al (1992), "Baculovirus Expression Vectors: A Laboratory 
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Manual", W. H. Freeman and Co., New York; which disclosures are hereby incorporated by 
reference in their entireties], the lambda PR promoter or also the trc promoter. 

Eukaryotic promoters include CMV immediate early, HSV thymidine kinase, early and 
late SV40, LTRs from retrovirus, and mouse metallothionein-L. Selection of a convenient vector 
5 and promoter is well within the level of ordinary skill in the art. 
Other regulatory elements 

Where a cDNA insert is employed, one will typically desire to include a polyadenylation 
signal to effect proper polyadenylation of the gene transcript. Also contemplated as an element of 
the expression cassette is a terminator. These elements can serve to enhance message levels and to 
1 0 minimize read through from the cassette into other sequences. 
Selectable Markers 

Selectable markers confer an identifiable change to the cell permitting easy identification 
of cells containing the expression construct. The selectable marker genes for selection of 
transformed host cells are preferably dihydrofolate reductase or neomycin resistance for eukaryotic 
15 cell culture, TRP1 for S. cerevisiae or tetracycline, rifampicin or ampicillin resistance in E. Coli, or 
levan saccharase for mycobacteria, this latter marker being a negative selection marker. 
Preferred Vectors 

Bacterial vectors 

As a representative but non-limiting example, useful expression vectors for bacterial use 
20 can comprise a selectable marker and a bacterial origin of replication derived from commercially 
available plasmids comprising genetic elements of pBR322 (ATCC 37017). Such commercial 
vectors include, for example, pKK223-3 (Pharmacia, Uppsala, Sweden), and pGEMl (Promega 
Biotec, Madison, WI, USA). Large numbers of other suitable vectors are known to those of skill in 
the art, and commercially available, such as the following bacterial vectors: pQE70, pQE60, pQE-9 
25 (Qiagen), pbs, pDIO, phagescript, psiX174, pbluescript SK, pbsks, pNH8A, pNH16A, pNH18A, 
pNH46A (Stratagene); ptrc99a, pKK223-3, pKK233-3, pDR540, pRTT5 (Pharmacia); pWLNEO, 
pSV2CAT, pOG44, pXTl, pSG (Stratagene); pSVK3, pBPV, pMSG, pSVL (Pharmacia); pQE-30 
(QIAexpress). 

Bacteriophage vectors 

30 The PI bacteriophage vector may contain large inserts ranging from about 80 to about 100 

kb. The construction of PI bacteriophage vectors such as pl58 or pl58/neo8 are notably described 
by Sternberg (1992) Trends Genet. 8:1-16, and Sternberg (1994) Mamm. Genome. 5:397-404, 
which disclosure is hereby incorporated by reference in its entirety. Recombinant PI clones 
comprising GENSET nucleotide sequences may be designed for inserting large polynucleotides of 

35 more than 40 kb [see, Linton et aL, (1993) J. Clin. Invest 92:3029-3037], which disclosure is 
hereby incorporated by reference in its entirety. To generate PI DNA for transgenic experiments, 
a preferred protocol is the protocol described by McCormick et aL, (1994) Genet. Anal. Tech. 
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Appl. 1 1:158-164, which disclosure is hereby incorporated by reference in its entirety. Briefly, E. 
coli (preferably strain NS3529) harboring the PI plasmid are grown overnight in a suitable broth 
medium containing 25 ng/ml of kanamycin. The PI DNA is prepared from the E. coli by alkaline 
lysis using the Qiagen Plasmid Maxi kit (Qiagen, Chatsworth, CA, USA), according to the 
5 manufacturer's instructions. The PI DNA is purified from the bacterial lysate on two Qiagen-tip 
500 columns, using the washing and elution buffers contained in the kit. A phenol/chloroform 
extraction is then performed before precipitating the DNA with 70% ethanol. After solubilizing 
the DNA in TE (10 mM Tris-HCl, pH 7.4, 1 mM EDTA), the concentration of the DNA is 
assessed by spectrophotometry. 

10 When the goal is to express a PI clone comprising GENSET polypeptide-encoding 

nucleotide sequences in a transgenic animal, typically in transgenic mice, it is desirable to remove 
vector sequences from the PI DNA fragment, for example by cleaving the PI DNA at rare-cutting 
sites within the PI polylinker (Sfil 9 Noil or Sail). The PI insert is then purified from vector 
sequences on a pulsed-field agarose gel, using methods similar to those originally reported for the 

15 isolation of DNA from YACs [see, e. g., Schedl et aL, (1993a), Nature, 362: 258-261; Peterson et 
ai, (1993), Proc. Natl. Acad. Sci. USA, 90 :7593-7597], which disclosures are hereby incorporated 
by reference in their entireties. At this stage, the resulting purified insert DNA can be 
concentrated, if necessary, on a Millipore Ultrafree-MC Filter Unit (Millipore, Bedford, MA, USA 
- 30,000 molecular weight limit) and then dialyzed against microinjection buffer (10 mM Tris- 

20 HC1, pH 7.4; 250 \iM EDTA) containing 100 mM NaCl, 30 \iM spermine, 70 \M spermidine on a 
microdyalisis membrane (type VS, 0.025 jiM from Millipore). The intactness of the purified PI 
DNA insert is assessed by electrophoresis on 1% agarose (Sea Kem GTG; FMC Bio-products) 
pulse-field gel and staining with ethidium bromide. 
Viral vectors 

25 In one specific embodiment, the vector is derived from an adenovirus. Preferred 

adenovirus vectors according to the invention are those described by Feldman and Steg, (1996), 
Medecine/Sciences, 12:47-55, or Ohno etai, (1994) Science. 265:781-784, which disclosures are 
hereby incorporated by reference in their entireties. Another preferred recombinant adenovirus 
according to this specific embodiment of the present invention is the human adenovirus type 2 or 5 

30 (Ad 2 or Ad 5) or an adenovirus of animal origin (French patent application No. FR-93 .05954, 
which disclosure is hereby incorporated by reference in its entirety). Further included in the 
present invention are ademo-associated virus vectors. 

Retrovirus vectors and adeno-associated virus vectors are generally understood to be the 
recombinant gene delivery systems of choice for the transfer of exogenous polynucleotides in vivo, 

35 particularly to mammals, including humans. Particularly preferred retroviruses for the preparation 
or construction of retroviral in vitro or in vitro gene delivery vehicles of the present invention 
include retroviruses selected from the group consisting of Mink-Cell Focus Inducing Virus, Murine 
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Sarcoma Virus, Reticuloendotheliosis virus and Rous Sarcoma virus. Particularly preferred 
Murine Leukemia Viruses include the 4070A and the 1504A viruses, Abelson (ATCC No VR- 
999), Friend (ATCC No VR-245), Gross (ATCC No VR-590), Rauscher (ATCC No VR-998) and 
Moloney Murine Leukemia Virus (ATCC No VR-190; PCT Application No WO 94/24298). 
5 Particularly preferred Rous Sarcoma Viruses include Bryan high titer (ATCC Nos VR-334, VR- 
657, VR-726, VR-659 and VR-728). Other preferred retroviral vectors are those described in Roth 
et al 9 (1996) Nature Medicine, 2(9):985-991, PCT Application No WO 93/25234, PCT 
ApplicationNo WO 94/ 06920, Roux et al (1989), Proc. Natl. Acad. Sci. U.S.A. 86:9079-9083, 
Julane/a/. (1992), J. Gen. Virol. 73:3251-3255, andNeda et al (1991), J. Biol. Chem. 266:14143- 
10 14146, which disclosures are hereby incorporated by reference in their entireties. 
BAC vectors 

The bacterial artificial chromosome (BAC) cloning system [Shizuya et al (1992), Proc. 
Natl. Acad. Sci. U.S.A. 89:8794-8797], which disclosure is hereby incorporated by reference in its 
entirety, has been developed to stably maintain large fragments of genomic DNA (100-300 kb) in 

15 E. coll A preferred BAC vector comprises a pBeloBACl 1 vector that has been described by Kim 
U-J. et al (1996), Genomics 34:213-218, which disclosure is hereby incorporated by reference in 
its entirety. BAC libraries are prepared with this vector using size-selected genomic DNA that has 
been partially digested using enzymes that permit ligation into either the Bam HI or HinaTK sites in 
the vector. Flanking these cloning sites are T7 and SP6 RNA polymerase transcription initiation 

20 . sites that can be used to generate end probes by either RNA transcription or PCR methods. After 
the construction of a BAC library in E. coli, BAC DNA is purified from the host cell as a 
supercoiled circle. Converting these circular molecules into a linear form precedes both size 
determination and introduction of the BACs into recipient cells. The cloning site is flanked by two 
Not I sites, permitting cloned segments to be excised from the vector by Not I digestion. 

25 Alternatively, the DNA insert contained in the pBeloBACl 1 vector may be linearized by treatment 
of the BAC vector with the commercially available enzyme lambda terminase that leads to the 
cleavage at the unique cosN site, but this cleavage method results in a full length BAC clone 
containing both the insert DNA and the BAC sequences. 
Baculovirus 

30 Another specific suitable host vector system is the pVL1392/1393 baculovirus transfer 

-vector (Pharmingen) that is used to transfect the SF9 cell line (ATCC No. CRL 171 1) which is 
derived from Spodoptera frugiperda. Other suitable vectors for the expression of the GENSET 
polypeptide of the present invention in a baculovirus expression system include those described by 
Chai et al. (1993), Biotechnol. Appl. Biochem. 18:259-273; Vlasak, et al (1983), Eur. J. Biochem. 

35 135:123-126, and Lenhard et al, (1996) Gene. 169:187-190, which disclosures are hereby 
incorporated by reference in their entireties. 
Delivery Of The Recombinant Vectors 
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To effect expression of the polynucleotides and polynucleotide constructs of the invention, 
the constructs must be delivered into a cell. This delivery may be accomplished in vitro, as in 
laboratory procedures for transforming cell lines, or in vivo or ex vivo, as in the treatment of certain 
diseases states. 

5 One mechanism is viral infection where the expression construct is encapsulated in an 

infectious viral particle. The expression construct, preferably a recombinant viral vector as 
discussed herein, may transduce packaging cells through any means known in the art such as 
electroporation, liposomes, and CaP04 precipitation. The packaging cell generates infectious viral 
particles that include a polynucleotide encoding a polypeptide of the present invention. Such viral 

10 particles then may be employed to transduce eukaryotic cells in vitro, ex vivo or in vivo. The 
transduced eukaryotic cells will express a polypeptide of the present invention. Preferably, the 
viruses used in the present invention are rendered replication deficient by deletion of one or more 
of all or a portion of the following genes: Ela, Elb, E3, E4, E2a, or LI through L5 (U.S. Patent 
6,228,844, which disclosure is hereby incorporated by reference in its entirety). Viral delivery is 

15 discussed in more detail herein (see also, U.S. Patent 5,968,821, which disclosure is hereby 
incorporated by reference in its entirety). 

Retrovirus vectors and adeno-associated virus vectors are generally understood to be the 
recombinant gene delivery system of choice for the transfer of exogenous genes in vivo, 
particularly into humans. These vectors provide efficient delivery of genes into cells, and the 

20 transferred nucleic acids are stably integrated into the chromosomal DNA of the host. A major 
prerequisite for the use of retroviruses is to ensure the safety of their use, particularly with regard 
to the possibility of the spread of wild-type virus in the cell population. The development of 
specialized cell lines (termed "packaging cells") which produce only replication-defective 
retroviruses has increased the utility of retroviruses for gene therapy, and defective retroviruses are 

25 well characterized for use in gene transfer for gene therapy purposes (for a review see Miller, A. D. 
(1990) Blood 76:271). Thus, recombinant retrovirus can be constructed in which part of the 
retroviral coding sequence (gag, pol, env) has been replaced by nucleic acid encoding one of the 
subject CCR-proteins, rendering the retrovirus replication defective. The replication defective 
retrovirus is then packaged into virions which can be used to infect a target cell through the use of 

30 a helper virus by standard techniques. 

Protocols for producing recombinant retroviruses and for infecting cells in vitro or in vivo 
with such viruses can be found in Current Protocols in Molecular Biology, Ausubel, F. M. et al. 
(eds.) Greene Publishing Associates, (1989), Sections 9.10-9.14 and other standard laboratory 
manuals. Examples of suitable retroviruses include pLJ, pZIP, pWE and pEM which are well 

35 known to those skilled in the art. Examples of suitable packaging virus lines for preparing both 
ecotropic and amphotropic retroviral systems include .psi.Crip, .psi.Cre, .psi.2 and .psi.Am. 
Retroviruses have been used to introduce a variety of genes into many different cell types, 
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including neural cells, epithelial cells, endothelial cells, lymphocytes, myoblasts, hepatocytes, bone 
marrow cells, in vitro and/or in vivo (see for example Eglitis, et al. (1985) Science 230:1395-1398; 
Danos and Mulligan (1988) Proc. Natl. Acad. Sci. USA 85:6460-6464; Wilson, et al. (1988) Proc. 
Natl. Acad. Sci. USA 85:3014-3018; Armentano, et al. (1990) Proc. Natl. Acad. Sci. USA 
5 87:6141-6145; Huber, et al. (1991) Proc. Natl. Acad. Sci. USA 88:8039-8043; FeiTy, et al. (1991) 
Proc. Natl. Acad. Sci. USA 88:8377-8381; Chowdhury, et al. (1991) Science 254:1802-1805; van 
Beusechem, et al. (1992) Proc. Natl. Acad. Sci. USA 89:7640-7644; Kay, et al. (1992) Human 
Gene Therapy 3:641-647; Dai, et al. (1992) Proc. Natl. Acad. Sci. USA 89:10892-10895; Hwu, et 
al. (1993) J. Immunol. 150:4104-4115; U.S. Pat. No. 4,868,116; U.S. Pat. No. 4,980,286; PCT 

10 Application WO 89/07136; PCT Application WO 89/02468; PCT Application WO 89/05345; and 
PCT Application WO 92/07573). 

Furthermore, it has been shown that it is possible to limit the infection spectrum of 
retroviruses and consequently of retroviral-based vectors, by modifying the viral packaging 
proteins on the surface of the viral particle (see, for example PCT publications W093/25234, 

15 WO94/06920, and W094/1 1524). For instance, strategies for the modification of the infection 
spectrum of retroviral vectors include: coupling antibodies specific for cell surface antigens to the 
viral env protein (Roux, et al. (1989) PNAS 86:9079-9083; Man, et al. (1992) J. Gen Virol 
73:3251-3255; and Goud, et al. (1983) Virology 163:251-254); or coupling cell surface ligands to 
the viral env proteins (Neda, et al. (1991) J Biol Chem 266: 14143-14146). Coupling can be in the 

20 form of the chemical cross-Unking with a protein or other variety (e.g. lactose to convert the env 
protein to an asialoglycoprotein), as well as by generating fusion proteins (e.g. single-chain 
antibody/ env fusion proteins). This technique, while useful to limit or otherwise direct the 
infection to certain tissue types, and can also be used to convert an ecotropic vector in to an 
amphotropic vector. 

25 Moreover, use of retroviral gene delivery can be further enhanced by the use of tissue- or 

cell-specific transcriptional regulatory sequences that control expression of the desired gene. 

Another viral gene delivery system useful in the present invention utilitizes adenovirus- 
derived vectors. The genome of an adenovirus can be manipulated such that it encodes a gene 
product of interest, but is inactivate in terms of its ability to replicate in a normal lytic viral life 

30 cycle (see, for example, Berkner, et al. (1988) BioTechniques 6:616; Rosenfeld, et al. (1991) 
Science 252:431-434; and Rosenfeld, et al. (1992) Cell 68:143-155). Suitable adenoviral vectors 
derived from the adenovirus strain Ad type 5 dl324 or other strains of adenovirus (e.g., Ad2 5 Ad3, 
Ad7 etc.) are well known to those skilled in the art. Recombinant adenoviruses can be 
advantageous in certain circumstances in that they are not capable of infecting nondividing cells 

35 and can be used to infect a wide variety of cell types, including airway epithelium (Rosenfeld, et 
al. (1992) cited supra), endothelial cells (Lemarchand et al.(1992) Proc. Natl. Sci. USA 89:6482- 
6486), hepatocytes (Herz and Gerard (1993) Proc. Natl. Acad. Sci. USA 90:2812-2816) and 
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muscle cells (Quantin, et al. (1992) Proc. Natl. Acad. Sci. USA 89:2581-2584). Furthermore, the 
virus particle is relatively stable and amenable to purification and concentration, and as above, can 
be modified so as to affect the spectrum of infectivity. Additionally, introduced adenoviral 
polynucleotides (and foreign polynucleotides contained therein) is not integrated into the genome 
5 of a host cell but remains episomal, thereby avoiding potential problems that can occur as a result 
of insertional mutagenesis in situations where introduced DNA becomes integrated into the host 
genome (e.g., retroviral DNA). Moreover, the carrying capacity of the adenoviral genome for 
foreign DNA is large (up to 8 kilobases) relative to other gene delivery vectors (Haj-Ahmand and 
Graham (1986) J. Virol. 57:267). Most replication-defective adenoviral vectors currently in use 

1 0 and therefore favored by the present invention are deleted for all or parts of the viral El and E3 
genes but retain as much as 80% of the adenoviral genetic material (see, e.g., Jones, et al. (1979) 
Cell 16:683; Berkner, et al., supra; and Graham, et al. in Methods in Molecular Biology, E. J. 
Murray, Ed. (Humana, Clifton, N.J., 1991) vol. 7. pp.109-127). Expression of desired 
polynucleotides can be under control of, for example, the El A promoter, the major late promoter 

15 (MLP) and associated leader sequences, the E3 promoter, or exogenously added promoter 
sequences. 

Yet another viral vector system useful for delivery of polynucleotides is the adeno- 
associated virus (AAV). Adeno-associated virus is a naturally occurring defective virus that 
requires another virus, such as an adenovirus or a herpes virus, as a helper virus for efficient 

20 replication and a productive life cycle. (For a review see Muzyczka, et al., Curr. Topics in Micro, 
and Immunol. (1992) 158:97-129). It is also one of the few viruses that may integrate its nucleic 
acids into non-dividing cells, and exhibits a high frequency of stable integration (see for example 
Flotte et al y (1992) Am. J. Respir. Cell Mol. Biol. 7:349-356; Am. J. Respir. Cell. Mol. Biol. 
7:349-356; Samulski et al. (1989) J. Virol. 63:3822-3828; and McLaughlin et al. (1989) J. Virol. 

25 62:1963-1973). Vectors containing as little as 300 base pairs of AAV can be packaged and can 
integrate. Space for exogenous DNA is limited to about 4.5 kb. An AAV vector such as that 
described in Tratschin, et al. (1985) Mol. Cell. Biol. 5:3251-3260 can be used to introduce DNA 
into cells. A variety of nucleic acids have been introduced into different cell types using AAV 
vectors (see for example Hermonat, et al. (1984) Proc. Natl. Acad. Sci. USA 81:6466r6470; 

30 Tratschin, et al. (1985) Mol. Cell. Biol. 4:2072-2081; Wondisford, et al. (1988) Mol. Endocrinol. 
2:32-39; Tratschin, et al. (1984) J. Virol. 51:61 1-619; and Flotte, et al. (1993) J. Biol. Chem. 
268:3781-3790). 

Other viral vector systems that may have application in gene therapy have been derived 
from herpes virus, vaccinia virus, and several RNA viruses. In particular, herpes virus vectors may 
35 provide a unique strategy for persistence of inserted gene expression in cells of the central nervous 
system and ocular tissue (Pepose, et al. (1994) Invest Ophthalmol Vis Sci 35:2662-2666). 

Several non-viral methods for the transfer of polynucleotides into cultured mammalian 
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cells are also contemplated by the present invention, and include, without being limited to, calcium 
phosphate precipitation [Graham et al, (1973) ViroL 52:456-457; Chen et al. (1987) Mol. Cell. 
Biol. 7:2745-2752]; DEAE-dextran [Gopal (1985) Mol. Cell. Biol., 5:1188-1190]; electroporation 
[Tur-Kaspa et al (1986) Mol. Cell. Biol. 6:716-718; Potter et al, (1984) Proc. Natl. Acad. Sci. 

5 U.S.A. 81(22):7161-7165]; direct microinjection (Harlandef a/., (1985) J. Cell. Biol. 101:1094- 
1095); DNA-loaded liposomes [Nicolau et al, (1982) Biochim. Biophys. Acta. 721:185-190; 
Fraley et al, (1979) Proc. Natl. Acad. Sci. USA. 76:3348-3352]; and receptor-mediated 
transfection. [Wu and Wu (1987), J. Biol. Chem. 262:4429-4432; and Wu and Wu (1988), 
Biochemistry 27:887-892], which disclosures are hereby incorporated by reference in their 

10 entireties. Some of these techniques may be successfully adapted for in vivo or ex vivo use, as 
discussed herein. 

Once the expression polynucleotide has been delivered into the cell, it may be stably 
integrated into the genome of the recipient cell. This integration may be in the cognate location 
and orientation via homologous recombination (gene replacement) or it may be integrated in a 

15 random, non-specific location (gene augmentation). In yet further embodiments, the nucleic acid 
may be stably maintained in the cell as a separate, episomal segment of DNA. Such nucleic acid 
segments or "episomes" encode sequences sufficient to permit maintenance and replication 
independent of or in synchronization with the host cell cycle. 

One specific embodiment for a method for delivering a protein or peptide to the interior of 

20 a cell of a vertebrate in vivo comprises the step of introducing a preparation comprising a 
physiologically acceptable carrier and a naked polynucleotide operatively coding for the 
polypeptide of interest into the interstitial space of a tissue comprising the cell, whereby the naked 
polynucleotide is taken up into the interior of the cell and has a physiological effect. This is 
particularly applicable for transfer in vitro but it may be applied to in vivo as well. 

25 Compositions for use in vitro and in vivo comprising a "naked" polynucleotide are 

described in PCT application No. WO 90/1 1092 (Vical Inc.) and also in PCT application No. WO 
95/1 1307 (Institut Pasteur, INSERM, Universite d'Ottawa) as well as in the articles of Tascon et 
al (1996), Nature Medicine. 2(8):888-892 and of Huygen et al, (1996) Nature Medicine. 
2(8):893-898, which disclosures are hereby incorporated by reference in their entireties. 

30 In still another embodiment of the invention, the transfer of a naked polynucleotide of the 

invention, including a polynucleotide construct of the invention, into cells may be accomplished 
with particle bombardment (biolistic), said particles being DNA-coated microprojectiles 
accelerated to a high velocity allowing them to pierce cell membranes and enter cells without 
killing them, such as described by Klein et al, (1987) Nature 327:70-73, which disclosure is 

35 hereby incorporated by reference in its entirety. Liposomal preparations for use in the present 
invention include canonic (positively charged), anionic (negatively charged) and neutral 
preparations. However, canonic liposomes are particularly preferred because a tight charge 
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complex can be formed between the cationic liposome and the polyanionic nucleic acid. Cationic 
liposomes have been shown to mediate intracellular delivery of plasmid DNA (Feigner, et al., 
Proc. Nat. Acad. Sci. USA (1987) 84:7413-7416, which is herein incorporated by reference); 
mRNA (Malone, et al., Proc. Natl. Acad. Sci. USA (1989) 86:6077-6081, which is herein 

5 incorporated by reference); and purified transcription factors (Debs et al., J. Biol. Chem. (1990) 
265:10189-10192, which is herein incorporated by reference), in functional form. 

Cationic liposomes are readily available. For example,Ntl-2,3-dioleyloxy)propyll-N,N,N- 
triethylammonium (DOTMA) liposomes are particularly useful and are available under the 
trademark Lipofectin, from GIBCO BRL, Grand Island,N.Y. (See, also, Feigner, et al., Proc. Nad 

10 Acad. Sci. USA (1987) 84:7413-7416, which is herein incorporated by reference). Other 
commercially available liposomes include transfectace (DDAB/DOPE) and DOTAP/DOPE 
(Boehringer). 

Similarly, anionic and neutral liposomes are readily available, such as from AvantiPolar 
Lipids (Birmingham, Ala.), or can be easily prepared using readily available materials. Such 

15 materials include phosphatidyl, choline, cholesterol, phosphatidyl ethanolamine, 

dioleoylphosphatidyl choline (DOPC), dioleoylphosphatidyl glycerol (DOPG), dioleoylphoshatidyl 
ethanolamine (DOPE), among others. These materials can also be mixed with the DOTMA and 
DOTAP starting materials in appropriate ratios. Methods for making liposomes using these 
materials are well known in the art. 

20 For example, commercially dioleoylphosphatidyl choline (DOPC), dioleoylphosphatidyl 

glycerol (DOPG), and dioleoylphosphatidyl ethanolamine (DOPE) can be used in various 
combinations to make conventional liposomes, with or without the addition of cholesterol. The 
liposomes can comprise multilamellar vesicles (MLVs), small unilamellar vesicles (SUVs), or 
large unilamellar vesicles (LUVs), with SUVs being preferred. The various liposome-nucleic acid 

25 complexes are prepared using methods well known in the art (Straubinger, et al., Methods of 
Immunology (1983), 101 :5 12-527, which is herein incorporated by reference). For example, 
. MLVs containing nucleic acid can be prepared by depositing a thin film of phospholipid on the 
walls of a glass tube and subsequently hydrating with a solution of the material to be encapsulated 
(U.S. Patent 5,965,421, which disclosure is hereby incorporated by reference). 

30 Generally, the ratio of DNA to liposomes will be from about 10: 1 to about 1: 10. Preferably, the 
ration will be from about 5:1 to about 1 :5. More preferably, the ratio will be about 3: 1 to about 1 : 
3. Still more preferably, the ratio will be about 1:1. Additionally, liposomes may be targeted to 
specific cell types by embedding a targeting moiety such as a member of a receptor- receptor 
ligand pair into the lipid envelope of the vesicle. Useful targeting moieties specifically bind cell 

35 surface ligands, for example, CD48 or the SCF receptor on mast cells. Thus, anti-CD48 antibodies 
or SCF ligand are examples of useful mast cell-targeting moieties (U.S. Patent 6177433, U.S. 
Patent 61 10490, and P.C.T No. WO9704748, which disclosures are hereby incorporated by 
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reference in their entireties). 

In a further embodiment, the polynucleotide of the invention may be entrapped in a 
liposome [Ghosh and Bacchawat, (1991), Targeting of liposomes to hepatocytes, IN: Liver 
Diseases, Targeted diagnosis and therapy using specific rceptors and ligands. Eds., Marcel 
5 Dekeker, New York, pp. 87-104; Wong, et al (1980), Gene. 10:87-94; Nicolau et aL, (1987), 
Meth. Enzymol., 149: 157-76, which disclosures are hereby incorporated by reference in their 
entireties]. 

In a specific embodiment, the invention provides a composition for the in vivo production 
of the GENSET polypeptides described herein. It comprises a naked polynucleotide operatively 

10 coding for this polypeptide, in solution in a physiologically acceptable carrier, and suitable for 
introduction into a tissue to cause cells of the tissue to express the said protein or polypeptide. 

The amount of vector to be injected to the desired host organism varies according to the 
site of injection. As an indicative dose, it will be injected between 0.1 and 100 ug of the vector in 
an animal body, preferably a mammal body, for example a mouse body. 

15 In another embodiment of the vector according to the invention, it may be introduced in 

vitro in a host cell, preferably in a host cell previously harvested from the animal to be treated and 
more preferably a somatic cell such as a muscle cell. In a subsequent step, the cell that has been 
transformed with the vector coding for the desired GENSET polypeptide or the desired fragment 
thereof is reintroduced into the animal body in order to deliver the recombinant protein within the 

20 body either locally or systemically. 
Secretion vectors 

Some of the GENSET cDNAs or genomic DNAs of the invention may also be used to 
construct secretion vectors capable of directing the secretion of the proteins encoded by genes 
inserted in the vectors. Such secretion vectors may facilitate the purification or enrichment of the 
25 proteins encoded by genes inserted therein by reducing the number of background proteins from 
which the desired protein must be purified or enriched. Exemplary secretion vectors are described 
below. 

The secretion vectors of the present invention include a promoter capable of directing gene 
expression in the host cell, tissue, or organism of interest. Such promoters include the Rous 

30 Sarcoma Virus promoter, the SV40 promoter, the human cytomegalovirus promoter, and other 
promoters familiar to those skilled in the art. 

A signal sequence from a polynucleotide of the invention and signal sequences of clone 
inserts of the deposited clone pool is operably linked to the promoter such that the mRNA 
transcribed from the promoter will direct the translation of the signal peptide. The host cell, tissue, 

35 or organism may be any cell, tissue, or organism which recognizes the signal peptide encoded by 
the signal sequence in the GENSET cDNA or genomic DNA. Suitable hosts include mammalian 
cells, tissues or organisms, avian cells, tissues, or organisms, insect cells, tissues or organisms, or 
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yeast. 

In addition, the secretion vector contains cloning sites for inserting genes encoding the 
proteins which are to be secreted. The cloning sites facilitate the cloning of the insert gene in 
frame with the signal sequence such that a fusion protein in which the signal peptide is fused to the 
5 protein encoded by the inserted gene is expressed from the mRNA transcribed from the promoter. 
The signal peptide directs the extracellular secretion of the fusion protein. 
The secretion vector may be DNA or RNA and may integrate into the chromosome of the host, be 
stably maintained as an extrachromosomal replicon in the host, be an artificial chromosome, or be 
transiently present in the host. Preferably, the secretion vector is maintained in multiple copies in 

10 each host cell. As used herein, multiple copies means at least 2, 5, 10, 20, 25, 50 or more than 50 
copies per cell. In some embodiments, the multiple copies are maintained extrachromosomally. In 
other embodiments, the multiple copies result from amplification of a chromosomal sequence. 

Many nucleic acid backbones suitable for use as secretion vectors are known to those 
skilled in the art, including retroviral vectors, SV40 vectors, Bovine Papilloma Virus vectors, yeast 

15 integrating plasmids, yeast episomal plasmids, yeast artificial chromosomes, human artificial 
chromosomes, P element vectors, baculovirus vectors, or bacterial plasmids capable of being 
transiently introduced into the host. 

The secretion vector may also contain a polyA signal such that the polyA signal is located 
downstream of the gene inserted into the secretion vector. 

20 After the gene encoding the protein for which secretion is desired is inserted into the 

secretion vector, the secretion vector is introduced into the host cell, tissue, or organism using 
calcium phosphate precipitation, DEAE-Dextran, electroporation, liposome-mediated transfection, 
viral particles or as naked DNA. The protein encoded by the inserted gene is then purified or 
enriched from the supernatant using conventional techniques such as ammonium sulfate 

25 precipitation, immunoprecipitation, immunochromatography, size exclusion chromatography, ion 
exchange chromatography, and hplc. Alternatively, the secreted protein may be in a sufficiently 
enriched or pure state in the supernatant or growth media of the host to permit it to be used for its 
intended purpose without further enrichment. 

The signal sequences may also be inserted into vectors designed for gene therapy. In such 

30 vectors, the signal sequence is operably linked to a promoter such that mRNA transcribed from the 
promoter encodes the signal peptide. A cloning site is located downstream of the signal sequence 
such that a gene encoding a protein whose secretion is desired may readily be inserted into the 
vector and fused to the signal sequence. The vector is introduced into an appropriate host cell. 
The protein expressed from the promoter is secreted extracellularly, thereby producing a 

35 therapeutic effect. 
Cell Hosts 

Another object of the invention comprises a host cell that has been transformed or 
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transfected with one of the polynucleotides described herein, and in particular a polynucleotide 
either comprising a GENSET polypeptide-encoding polynucleotide regulatory sequence or the 
polynucleotide coding for a GENSET polypeptide. Also included are host cells that are 
transformed (prokaryotic cells), transfected (eukaryotic cells), or transduced with a recombinant 
5 vector such as one of those described above. However, the cell hosts of the present invention can 
comprise any of the polynucleotides of the present invention. Preferred host cells used as 
recipients for the expression vectors of the invention are the following: 

a) Prokaryotic host cells: Escherichia coli strains (I.E.DH5-a strain), Bacillus 
subtilis, Salmonella typhimurium, and strains from species like Pseudomonas, 

1 0 Streptomyces and Staphylococcus, 

b) Eukaryotic host cells: HeLa cells (ATCC No.CCL2; No.CCL2. 1; No.CCL2.2), Cv 
1 cells (ATCC No.CCL70), COS cells (ATCC No.CRL1650; No.CRL1651), Sf-9 
cells (ATCC No.CRL1711), C127 cells (ATCC No. CRL-1804), 3T3 (ATCC No. 
CRL-6361), CHO (ATCC No. CCL-61), human kidney 293. (ATCC No. 45504; 

15 No. CRL-1573) and BHK (ECACC No. 84100501; No. 841 11301). 

The present invention also encompasses primary, secondary, and immortalized 
homologously recombinant host cells of vertebrate origin, preferably mammalian origin and 
particularly human origin, that have been engineered to: a) insert exogenous (heterologous) 
polynucleotides into the endogenous chromosomal DNA of a targeted gene, b) delete endogenous 

20 chromosomal DNA, and/or c) replace endogenous chromosomal DNA with exogenous 

polynucleotides. Insertions, deletions, and/or replacements of polynucleotide sequences may be to 
the coding sequences of the targeted gene and/or to regulatory regions, such as promoter and 
enhancer sequences, operably associated with the targeted gene. 

In addition to encompassing host cells containing the vector constructs discussed herein, 

25 the invention also encompasses primary, secondary, and immortalized host cells of vertebrate 
origin, particularly mammalian origin, that have been engineered to delete or replace endogenous 
genetic material (e.g., coding sequence), and/or to include genetic material (e.g., heterologous 
polynucleotide sequences) that is operably associated with the polynucleotides of the invention, 
and which activates, alters, and/or amplifies endogenous polynucleotides. For example, techniques 

30 known in the art may be used to operably associate heterologous control regions (e.g., promoter 
and/qr enhancer) and endogenous polynucleotide sequences via homologous recombination, see, 
e.g., U.S. Patent No. 5,641,670, issued June 24, 1997; International Publication No. WO 96/2941 1, 
published September 26, 1996; International Publication No. WO 94/12650, published August 4, 
1994; Koller, et a!., (1989); and Zijlstra, et al (1989) (the disclosures of each of which are 

35 incorporated by reference in their entireties). 

The present invention further relates to a method of making a homologously recombinant 
host cell in vitro or in vivo, wherein the expression of a targeted gene not normally expressed in the 
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cell is altered. Preferably the alteration causes expression of the targeted gene under normal 
growth conditions or under conditions suitable for producing the polypeptide encoded by the 
targeted gene. The method comprises the steps of: (a) transfecting the cell in vitro or in vivo with 
a polynucleotide construct, said polynucleotide construct comprising; (i) a targeting sequence; 
5 (ii) a regulatory sequence and/or a coding sequence; and (iii) an unpaired splice donor site, if 
necessary, thereby producing a transfected cell; and (b) maintaining the transfected cell in vitro or 
in vivo under conditions appropriate for homologous recombination. 

The present invention further relates to a method of altering the expression of a targeted 
gene in a cell in vitro or in vivo wherein the gene is not normally expressed in the cell, comprising 

10 the steps of: (a) transfecting the cell in vitro or in vivo with a polynucleotide construct, said 
polynucleotide construct comprising: (i) a targeting sequence; (ii) a regulatory sequence and/or a 
coding sequence; and (iii) an unpaired splice donor site, if necessary, thereby producing a 
transfected cell; and (b) maintaining the transfected cell in vitro or in vivo under conditions 
appropriate for homologous recombination, thereby producing a homologously recombinant cell; 

1 5 and (c) maintaining the homologously recombinant cell in vitro or in vivo under conditions 
appropriate for expression of the gene. 

The present invention further relates to a method of making a polypeptide of the present 
invention by altering the expression of a targeted endogenous gene in a cell in vitro or in vivo 
wherein the gene is not normally expressed in the cell, comprising the steps of: a) transfecting the 

20 cell in vitro with a polynucleotide construct, said polynucleotide construct comprising: (i) a 
targeting sequence; (ii) a regulatory sequence and/or a coding sequence; and (iii) an unpaired 
splice donor site, if necessary, thereby producing a transfected cell; (b) maintaining the transfected 
cell in vitro or in vivo under conditions appropriate for homologous recombination, thereby 
producing a homologously recombinant cell; and c) maintaining the homologously recombinant 

25 cell in vitro or in vivo under conditions appropriate for expression of the gene thereby making the 
polypeptide. 

The present invention further relates to a polynucleotide construct which alters the 
expression of a targeted gene in a cell type in which the gene is not normally expressed. This 
occurs when the polynucleotide construct is inserted into the chromosomal DNA of the target cell, 

30 wherein said polynucleotide construct comprises: a) a targeting sequence; b) a regulatory sequence 
and/or coding sequence; and c) an unpaired splice-donor site, if necessary. Further included are a 
polynucleotide construct, as described above, wherein said polynucleotide construct further 
comprises a polynucleotide which encodes a polypeptide and is in-frame with the targeted 
endogenous gene after homologous recombination with chromosomal DNA. 

35 The compositions may be produced, and methods performed, by techniques known in the 

art, such as those described in U.S. Patent NOs: 6,054,288; 6,048,729; 6,048,724; 6,048,524; 
5,994,127; 5,968,502; 5,965,125; 5,869,239; 5,817,789; 5,783,385; 5,733,761; 5,641,670; 
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5,580,734; International Publication NOs: W096/29411, WO 94/12650; and scientific articles 
described by Koller, et aL, (1994). (The disclosures of each of which are incorporated by 
reference in their entireties.) 

GENSET gene expression in mammalian cells, preferably human cells, may be rendered 

5 defective, or alternatively may be altered by replacing endogenous GENSET polypeptide-encoding 
genes in the genome of an animal cell by a GENSET polypeptide-encoding polynucleotide 
according to the invention. These genetic alterations may be generated by homologous 
recombination using previously described specific polynucleotide constructs. 

Mammal zygotes, such as murine zygotes may be used as cell hosts. For example, murine 

1 0 zygotes may undergo microinjection with a purified DNA molecule of interest. 

Any one of the polynucleotides of the invention, including the Polynucleotide constructs 
described herein, may be introduced in an embryonic stem (ES) cell line, preferably a mouse ES 
cell line. ES cell lines are derived from pluripotent, uncommitted cells of the inner cell mass of 
pre-implantation blastocysts. Preferred ES cell lines are the following: ES-E14TG2a (ATCC 

15 No.CRL-1821), ES-D3 (ATCCNo.CRL1934 and No. CRL-11632), YS001 (ATCC No. CRL- 
1 1776), 36.5 (ATCC No. CRL-1 1 1 16). ES cells are maintained in an uncommitted state by culture 
in the presence of growth-inhibited feeder cells which provide the appropriate signals to preserve 
this embryonic phenotype and serve as a matrix for ES cell adherence. Preferred feeder cells are 
primary embryonic fibroblasts that are established from tissue of day 13- day 14 embryos of 

20 virtually any mouse strain, that are maintained in culture, such as described by Abbondanzo et al. 9 
(1993), Meth. EnzymoL, Academic Press, New York, pp 803-823 and are growth-inhibited by 
irradiation, such as described by Robertson, (1987), Embryo-derived stem cell lines; In: EJ. 
Robertson Ed. Teratocarcinomas and embrionic stem cells: a practical approach. IRL Press, 
Oxford, pp. 71, or by the presence of an inhibitory concentration of LIF, such as described by 

25 Pease and William, (1990), Exp. Cell. Res. 190: 209-21 1, which disclosures are hereby 
incorporated by reference in their entireties. 

The constructs in the host cells can be used in a conventional manner to produce the gene 
product encoded by the recombinant sequence. 
Transgenic Animals 

30 The terms 'transgenic animals" or "host animals" are used herein to designate animals that 

have their genome genetically and artificially manipulated so as to include one of the nucleic acids 
according to the invention. The cells affected may be somactic, germ cells, or both. Preferred 
animals are non-human mammals and include those belonging to a genus selected from Mus (e.g. 
mice), Rattus (e.g. rats) and Oryctogalus (e.g. rabbits) which have their genome artificially and 

35 genetically altered by the insertion of a nucleic acid according to the invention. In one 

embodiment, the invention encompasses non-human host mammals and animals comprising a 
recombinant vector of the invention or a GENSET gene disrupted by homologous recombination 
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with a knock out vector. 

The transgenic animals of the invention all include within a plurality of their cells a cloned 
recombinant or synthetic DNA sequence, more specifically one of the purified or isolated nucleic 
acids comprising a GENSET polypeptide coding sequence, a GENSET polynucleotide regulatory 
5 sequence, a polynucleotide construct, or a DNA sequence encoding an antisense polynucleotide 
such as described in the present specification. 

Generally, a transgenic animal according the present invention comprises any of the 
polynucleotides, the recombinant vectors and the cell hosts described in the present invention. In a 
first preferred embodiment, these transgenic animals may be good experimental models in order to 

10 study the diverse pathologies related to the dysregulation of the expression of a given GENSET 
gene, in particular the transgenic animals containing within their genome one or several copies of 
an inserted polynucleotide encoding a native GENSET polypeptide, or alternatively a mutant 
GENSET polypeptide. 

In a second preferred embodiment, these transgenic animals may express a desired 

1 5 polypeptide of interest under the control of the regulatory polynucleotides of the GENSET gene, 
leading to high yields in the synthesis of this protein of interest, and eventually to tissue specific 
expression of the protein of interest. 

The design of the transgenic animals of the invention may be made according to the 
conventional techniques well known from the one skilled in the art For more details regarding the 

20 production of transgenic animals, and specifically transgenic mice, it may be referred to US 

Patents Nos 4,873,191, issued Oct 10, 1989; 5,464,764 issued Nov 7, 1995; and 5,789,215, issued 
Aug 4, 1998; these documents being herein incorporated by reference to disclose methods 
producing transgenic mice. 

Transgenic animals of the present invention are produced by the application of procedures 

25 which result in an animal with a genome that has incorporated exogenous genetic material. The 
procedure involves obtaining the genetic material which encodes either a GENSET polypeptide 
coding sequence, a GENSET polynucleotide regulatory sequence, or a DNA sequence encoding a 
GENSET polynucleotide antisense sequence, or a portion thereof, such as described in the present 
specification. A recombinant polynucleotide of the invention is inserted into an embryonic or ES 

30 stem cell line. [See, e.g., Thomas, et al (1987) Cell. 51:503-512, which disclosure is hereby 

incorporated by reference in its entirety.] An illustrative positive-negative selection procedure that 
may be used according to the invention is described by Mansour et al. 9 (1988) Nature. 336:348- 
352, which disclosure is hereby incorporated by reference in its entirety. 

The positive cells are then isolated, cloned and injected into 3.5 days old blastocysts from 

35 mice, such as described by Bradley (1987) [Production and analysis of chimaeric mice In: E J. 
Robertson (Ed), Teratocarcinomas and embryonic stem cells: A practical approach. IRL Press, 
Oxford, pp.1 13)], which disclosure is hereby incorporated by reference in its entirety. The 
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blastocysts are then inserted into a female host animal and allowed to grow to term. Alternatively, 
the positive ES cells are brought into contact with embryos at the 2.5 days old 8-16 cell stage 
(morulae) such as described by Wood, et al (1993), Proc. Natl. Acad. Sci. USA, 90: 4582-4585, or 
by Nagy et al, (1993), Proc. Natl. Acad. Sci. USA 90: 8424-8428, which disclosures are hereby 
5 incorporated by reference in their entireties, the ES cells being internalized to colonize extensively 
the blastocyst including the cells which will give rise to the germ line. 

The offspring of the female host are tested to determine which animals are transgenic e.g. 
include the inserted exogenous DNA sequence and which ones are wild type. 

Thus, the present invention also concerns a transgenic animal containing a nucleic acid, a 

10 recombinant expression vector or a recombinant host cell according to the invention. 

In another embodiment, transgenic animals are produced by microinjecting 
polynucleotides ares microinjected into a fertilized oocyte. Methods for culturing fertilized 
oocytes to the pre-implantation stage are described, e.g., by Gordon, et al. ((1984) Methods in 
Enzymology, 101, 414); Hogan, et al. [(1986) in Manipulating the mouse embryo, A Laboratory 

15 Manual. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y (for the mouse embryo)] ; 
Hammer, et al. [(1985) Nature, 315, 680 (for rabbit and porcine embryos)]; Gandolfi, et al. [(1987) 
J. Reprod. Fert. 81, 23-28]; Rexroad, et al. [(1988) J. Anim. Sci. 66, 947-953) (for ovine 
embryos)]; and Eyestone, et al. [(1989) J. Reprod. Fert. 85, 715-720]; Camous et al. [(1984) J. 
Reprod. Fert. 72, 779-785]; and Heyman, et al. [(1987) Theriogenology 27, 5968 (for bovine 

20 embryos)]; the disclosures of each of which are incorporated herein in their entireties. Pre- 
implantation embryos are then transferred to an appropriate female by standard methods to permit 
the birth of a transgenic or chimeric animal, depending upon the stage of development when the 
transgene is introduced. 

Any of a number of methods known in the art can be used to detect the presence of a 

25 transgene in a pre-implantation embryo. 

In a particularly preferred embodiment of the present invention, transgenic mammals are 
generated that secrete recombinant GENSET polypeptides in their milk. As the mammary gland is 
a highly efficient protein-producing organ, such methods can be used to produce protein 
concentrations in the gram per liter range, and often significantly more. Preferably, expression in 

30 the mammary gland is accomplished by operably linking the polynucleotide encoding the 
GENSET polypeptide to a mammary gland specific promoter and, optionally, other regulatory 
elements. Suitable promoters and other elements include, but are not limited to, those derived 
from mammalian short and long WAP, alpha, beta, and kappa, casein, alpha and beta 
lactoglobulin, beta-CN 5' genes, as well as the the mouse mammary tumor virus (MMTV) 

35 promoter. Such promoters and other elements may be derived from any mammal, including, but 
not limited to, cows, goats, sheep, pigs, mice, rabbits, and guinea pigs. Promoter and other 
regulatory sequences, vectors, and other relevant teachings are provided, e.g., by Clark (1998) J 
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Mammary Gland Biol Neoplasia 3:337-50; Jost, et al. (1999) Nat. Biotechnol 17:160-4; U.S. 
Patent Nos. 5,994,616; 6,140,552; 6,013,857; Sohn, et al. (1999) DNA Cell Biol. 18:845-52; Kim, 
et al. (1999) J. Biochem. (Japan) 126:320-5; Soulier, et al. (1999) Euro. J. Biochem. 260:533-9; 
Zhang, et al. (1997) Chin. J. Biotech. 13:271-6; Rijnkels, et al. (1998) Transgen. Res. 7:5-14; 
5 Korhonen, et al. (1997) Euro. J. Biochem. 245:482-9; Uusi-Oukari, et al. (1997) Transgen. Res. 
6:75-84; Hitchin, et al. (1996) Prot. Expr. Purif. 7:247-52; Platenburg, et al. (1994) Transgen. Res. 
3:99-108; Heng-Cherl, et al. (1993) Animal Biotech. 4:89-107; and Christa, et al. (2000) Euro. J. 
Biochem. 267: 1 665 -7 1 ; the entire disclosure of each of which is herein incorporated by reference. 
In another embodiment, the polypeptides of the invention can be produced in milk by 

10 introducing polynucleotides encoding the polypeptides into somatic cells of the mammary gland in 
vivo, e.g. mammary secreting epithelial cells. For example, plasmid DNA can be infused through 
the nipple canal, e.g. in association with DEAE-dextran (see, e.g., Hens, et al. (2000) Biochim. 
Biophys. Acta 1523:161-171), in association with a ligand that can lead to receptor-mediated 
endocytosis of the construct (see, e.g., Sobolev, et al. (1998) 273:7928-33), or in a viral vector such 

15 as a retroviral vector, e.g. the Gibbon ape leukemia virus (see, e.g., Archer, et al. (1994) PNAS 
91:6840-6844). hi any of these embodiments, the polynucleotide may be operably linked to a 
mammary gland specific promoter, as described above, or, alternatively, any strongly expressing 
promoter such as CMV or MoMLV LTR. 

The suitability of any vector, promoter, regulatory element, etc. for use in the present 

20 invention can be assessed beforehand by transfecting cells such as mammary epithelial cells, e.g. 
MacT cells (bovine mammary epithelial cells) or GME cells (goat mammary epithelial cells), in 
vitro and assessing the efficiency of transfection and expression of the transgene in the cells. 

In a preferred embodiment, a retroviral vector such as as Gibbon ape leukemia viral vector 
is used, as described in Archer, et al. ((1994) PNAS 91:6840-6844). As retroviral infection 

25 typically requires cell division, cell division in the mammary glands can be stimulated in 

conjunction with the administration of the vector, e.g. using a factor such as estradiol benzoate, 
progesterone, reserpine, or dexamethasone. Further, retroviral and other methods of infection can 
be facilitated using accessory compounds such as polybrene. Alternatively, an adenoviral or 
adeno-associated viral vector may be used to infect non-dividing cells as discussed herein. 

30 In any of the herein-described methods for obtaining GENSET polypeptides from milk, the 

quantity of milk obtained, and thus the quantity of GENSET polypeptides produced, can be 
enhanced using any standard method of lacation induction, e.g. using hexestrol, estrogen, and/or 
progesterone. 

The polynucleotides used in such embodiments can either encode a full-length GENSET 
35 protein or a GENSET fragment. Typically, the encoded polypeptide will include a signal sequence 
to ensure the secretion of the protein into the milk. 

Recombinant Cell Lines Derived From The Transgenic Animals Of Tlie Invention: 
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A further object of the invention comprises recombinant host cells obtained from a 
transgenic animal described herein. In one embodiment the invention encompasses cells derived 
from non-human host mammals and animals comprising a recombinant vector of the invention or a 
GENSET gene disrupted by homologous recombination with a knock out vector. 

5 Recombinant cell lines may be established in vitro from cells obtained from any tissue of a 
transgenic animal according to the invention, for example by transfection of primary cell cultures 
with vectors expressing o/*c-genes such as SV40 large T antigen, as described by Chou, (1989), 
Mol. Endocrinol. 3: 1511-1514, and Shay et al 9 (1991), Biochem. Biophys. Acta, 1072: 1-7, which 
disclosures are hereby incorporated by reference in their entireties. 

10 Uses of polypeptides of the invention 

Protein of SEQ ID NO:24 (Internal designation Clone 47-14-l-C3-CL0_5) 

The cDNA of clone 47-14-1 -C3-CL0_5 (SEQ ID NO:23) encodes the protein of SEQ ID 
NO:24, comprising the amino acid sequence: 
MWFIYLQAHFTLCSGWSSTYRDLRKGVYW 

15 AAITESDKKTNGSNWEGILGLAYAEIA 
NQSEVIASVGGSNfflGGlDHSLYTGSL^ 
DKSIVDSGTTNLRLPKKVFEAAVKSIKAAS 
SLYLMGEVTNQSFRTTILPQ 
FDRARKRIGFAVSACHXO^ 

20 CALFMLPLCIJMVCQWRCI^(XRQQHDDFADDISLLK. Accordingly, it will be appreciated 
that all characteristics and uses of polypeptides of SEQ ID NO:24 described throughout the present 
application also pertain to the polypeptides encoded by the nucleic acids included in Clone 47-14- 
1-C3-CL0_5. In addition, it will be appreciated that all characteristics and uses of the 
polynucleotides of SEQ ID NO:23 described throughout the present application also pertain to the 

25 nucleic acids included in Clone 47-14-l-C3-CL0_5. A preferred embodiment of the invention is 
directed toward the compositions of SEQ ED NO:23, SEQ ED NO:24, and Clone 47-14-1-C3- 
CL0_5. Also preferred are polypeptide fragments having a biological activity as described herein 
and the polynucleotides encoding the fragments. 
Further preferred are compositions comprising the amino acid sequence: 

30 SPEPFFDSLVKQTHWNLFSLQLCGAGFPLNQSEVLASVGGSMnGGE)HSLY 
RREWYYEVIIVRVEING 
TEKFPDGFWLGEQLVCWQAGTTPWNIF^ 
SQDDCYKFMSQSSTGTVMGAVIMEGFYVVFDRARKM 
VTIJDMEDCGYNffQTDESTLMTIAY; 

35 DIXMDCKEYNYDKSIYDSGTTNLR^ 
Qand 

AITESDKFFINGSNWEGILGLAYAEIA^ 
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QSEVIASVGGSI^GGIDHSLYTGSLWYTPIRREWYYEV 
KSIVDSGTTNLRIJKKVFEAAVKSIK^ 

LYLMGEVTNQSFmiLPQQYLRPVEDVATSQDDCYOAISQSSTGTVM 
DRAKKRIGFAVSACH. Also preferred are polypeptide fragments having a biological activity as 
5 described herein and the polynucleotides encoding the fragments. 

The protein of SEQ ID NO:24 encodes amyloid processing inhibitor protein (APIP ). APIP 
is expressed in mammalian tissues, particularly in neuronal cells, and is an incomplete aspartyl 
protease which is able to bind substrate but lacks catalytic activity. Examples of compounds which 
interact with APIP include, but are not limited to, amyloid beta precursor protein, amyloid 

1 0 precursor like protein-1 , amyloid precursor like protein-2, Protease nexin-2, Anti-trypsin protein, 
Kunitz protease inhibitors and amyloid like proteins. 

Amyloid beta precursor protein (APP) can be processed by several types of proteases to 
yield fragments that are soluble or insoluble (Nunan and Small, FEBS Lett (2000) 483(1):6-10, 
which disclosures are hereby incorporated by reference in their entirety). Sequential cleavage of 

15 APP by beta secretase and gamma secretase yields a secreted and insoluble fibrillar amyloid 
protein, known as beta amyloid, which is the major component of extracellular amyloid plaques. 
Deposition of beta amyloid proteins form intraneuronal neurofibrillary tangles, amyloid plaques 
and vascular amyloid deposits characteristic of both Alzheimer's Disease and aged Down's 
Syndrome. Defects in processing APP can also lead to cerebral hemorraghage. Polypeptides of 

20 SEQ ID NO:24 and fragments thereof, bind to APP and other amyloid like proteins, and reduce the 
rate of processing of these proteins. 

In a number of embodiments, APIP is used to bind to and/or inhibit any of a number of 
substrates in a biological sample. For example, one preferred embodiment is directed to a method 
of contacting compositions comprising APIP with APP. Further preferred is a method of 

25 contacting compositions comprising APIP with amyloid precursor like protein-1 (APLP1). Still 
further preferred is a method of contacting compositions comprising APIP with amyloid precursor 
like protein-2 (APLP2). Such methods are useful, e.g. to inhibit the activity of the substrate such 
as APP, APLP1, or APLP2, or to label the substrate, e.g. by labeling APIP and using it to 
specifically bind to and thus allow the visualization of the substrate or a cell or tissue expressing 

30 the substrate. 

Another embodiment is directed at a method for reducing catabolism of extracellular 
secreted amyloid beta precursor protein (APP) which comprises contacting a mammalian cell with 
APIP. Preferably the said mammalian cell produces APP. The mammalian cell is preferably a 
neuronal cell The mammal is preferably a rodent, canine, or primate. 
35 Another embodiment is directed at a method for reducing catabolism of extracellular 

secreted APLP1 which comprises contacting a mammalian cell with APIP. Preferably the said 
mammalian cell produces APLP1 . The mammalian cell is preferably a neuronal cell. The 
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mammal is preferably a rodent, canine, or primate. 

Another embodiment is directed at a method for reducing catabolism of extracellular 
secreted APLP2 which comprises contacting a mammalian cell with APIP. Preferably the said 
mammalian cell produces APLP2. The mammalian cell is preferably a neuronal cell. The 
5 mammal is preferably a rodent, canine, or primate. 

Amyloid plaques in the brain contribute to disruption of neuronal conductivity which leads 
to disturbances in behavior, perception, memory and mood. Another preferred embodiment of the 
invention is directed to a method of preventing or alleviate mood disorders by contacting 
compositions comprising APIP neuronal cells. Further preferred is a method to prevent or alleviate 
10 schizophrenia by contacting compositions comprising APIP with neuronal cells. Still further 
preferred is a method to prevent or alleviate Alzheimer's disease by contacting compositions 
comprising APIP with neuronal cells. 

Amyloidosis also occurs in the pancreas and may contribute to the development of glucose 
intolerance, insulin insufficiency, or diabetes. A preferred embodiment is directed to a method of 
1 5 preventing of alleviating glucose intolerance by contacting compositions comprising APIP with 
pancreatic cells. Further preferred is a method to prevent or alleviate insulin insufficiency by 
contacting compositions comprising APIP with pancreatic cells. Still further preferred is a method 
to prevent or alleviate diabetes by contacting compositions comprising APIP with pancreatic cells. 

It should be appreciated that preferred compositions of the invention to be used in methods 
20 of the invention described for clone 47-14-l-C3-CL0_5 of SEQ ID NO:23 include polypeptides of 
SEQ ID NO:24 (APIP), and fragments thereof, and compositions comprising the polypeptides of 
SEQ ID NO:24, and fragments thereof. 

Protein of SEQ ED NO:28 (Internal designation Clone 117401_106-006-4-0-Bll-F) 

The cDNA of clone 1 17401_1 06-006-4-0-B1 1-F (SEQ ID NO:27) encodes the protein of 
25 SEQ ID NO:28. Accordingly, it will be appreciated that all characteristics and uses of 

polypeptides of SEQ ID NO:28 described throughout the present application also pertain to the 
polypeptides encoded by the nucleic acids included in Clone 1 1740 l_106-006-4-0-Bl 1-F. In 
addition, it will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NO:27 described throughout the present application also pertain to the nucleic acids included in 
30 Clone 1 17401_106-006-4-0-B 1 1-F. Also preferred are polypeptide fragments having a biological 
activity as described herein and the polynucleotides encoding the fragments/The gene for the 
protein of SEQ ID NO:28 is located on chromosome 8. 

The protein of SEQ ID NO:28 is referred to herein as Frangiopogen. Frangiopogen is 
highly expressed in human fetal liver and lung. It stimulates liver regeneration, has mitogenic 
35 activity and is actively involved in embryonic development. Frangiopogen is involved in complex 
regulatory processes including cell proliferation and angiogenesis. 

In a preferred embodiment of the invention, Frangiopogen is used in tissue treatment 
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compositions to promote wound healing, preferably after injury, such as ischemia, or after surgery, 
including general surgery, ear-, nose- and throat surgery, tissue transplantation, dermal or dental or 
artificial joint transplants, or plastic surgery. Further preferred are uses for Frangiopogen in tissue 
treatment compositions for tissue regeneration. 
5 Preferred tissue treatment compositions of the present invention include physiologically 

acceptable formulations comprising the protein of SEQ ID NO:28. Further preferred are 
physiologically acceptable formulations comprising the protein of SEQ ID NO:28 in combination 
with an additional compound such as any or all of the compounds selected from the group 
consisting of fibrin, fibrinogen, thrombin, factor XHI, calcium chloride, a plasminogen activator, a 

10 plasmin inhibitor (such as aprotin), a growth factor, and a polysaccharide such as hyaluronic acid. 
Still further preferred are formulations comprising the protein of SEQ ID NO:28 alone or in 
compositions, e.g. as described in US Patents 6,083,902 and 5,631,01 1, herein incorporated by 
reference in their entireties. 

In further embodiments, the tissue treatment compositions of the invention are used in 

15 methods of treating injuries comprising the step of contacting a wound or injured tissue with a 
healing or regenerative effective amount (an amount that would increase the rate or progression of 
healing or regeneration as compared to the same wound or injured tissue not treated with a 
composition of the present invention) of a Frangiopogen polypeptide. Further embodiments 
include use of the tissue treatment compositions of the invention for topical application to a site of 

20 injury (e.g. as defined as a site in which the integument is damaged in such a way as to expose the 
dermis), following an accident or following surgery comprising the step of contacting the injured 
tissue with a healing or regenerative effective amount of a Frangiopogen polypeptide. In still 
further embodiments, the tissue treatment compositions of the invention, alone or in combination 
with chondrocytes such as embryonic chondrocytes, are used in methods to treat joint cartilage and 

25 bone defect repair. 

The present invention provides for methods of stimulating proliferation of endothelial cells 
comprising the step of contacting endothelial cells with a proliferative effective amount of a 
Frangiopogen polypeptide of the present invention. Preferably the endothelial cells are vascular 
endothelial cells, arterial or venous. Further preferably, the method results in angiogenesis or the 

30 process of vascularization of a tissue involving the development of new capillary blood vessels. 
Preferably, angiogenesis occurs in a mammal, more preferably the mammal is a dog, cat, horse, 
cow, pig or human. 

In addition, the present invention provides for an antibody that specifically binds a 
Frangiopogen polypeptide of the present invention. The antibody may be monoclonal or 
35 polyclonal. The invention also provides for a method of inhibiting the growth of endothelial cells 
comprising the step of contacting a biological sample comprising endothelial cells with a growth 
inhibiting effective amount of an anti-Frangiopogen antibody. Preferably, the endothelial cells are 
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vascular endothelial cells, arterial or venous. Further preferably, the methods results in the 
inhibition of angiogenesis or blood vessel growth. Further preferably, the inhibition of 
angiogenesis occurs in a mammal, more preferably the mammal is a dog, cat, horse, cow, pig or 
human. 

5 Alternatively, the invention provides for a Frangiopogen polypeptide-cytotoxic agent 

conjugate, whereby the cytotoxic agent is covalently or noncovalently, recombinantly or 
nonrecombinantly, attached or conjugated to a Frangiopogen polypeptide using cytotoxic agents 
and methods well known in the art. The invention also provides for a method of inhibiting the 
growth of endothelial cells comprising the step of contacting a biological sample comprising 

10 endothelial cells or an individual with a growth inhibiting or endothelial cell killing effective 

amount of a Frangiopogen-cytotoxic agent conjugate. Preferably, the endothelial cells are vascular 
endothelial cells. Further preferably, the methods results in the inhibition of angiogenesis or blood 
vessel growth. Further preferably, the inhibition of angiogenesis occurs in a mammal, more 
preferably the mammal is a dog, cat, horse, cow, pig or human. To examine whether a particular 

15 anti -Frangiopogen antibody or a Frangiopogen-cytotoxic agent conjugate is useful to disrupt 
vascular growth or angiogenesis, models well known in the art may be sued, e.g.i the chick 
chorioallantoic membrane assay. 

Preferred polypeptides for use in the methods of the present invention include the polypeptides of 
SEQ ID NO:28 comprising the amino acid sequence: 
20 MRLRAQVRLLETRVKQQQVTGKQLLQENEVQFLDKGDENTVVDLG 
GYKLSGF¥TGKPLQSPAEFSWCT3MSD<^ 

QKHGEYWLGNKM.HFLTTQEDYTLKIDLADFEKNSRYAQYKNFKV 
SGTAGDSLAGNTWEVQWWASHQRMKFSTWDPJ3HDNYEGNCAEEDQSGWWI^aK 
ANLNGVYYSGPYTAKTDNGrVWYTWHGWWYSL^ a polypeptide 

25 comprising the amino acid sequence of: 

MAKWSFILVTTALMGREISALEDCAQEQ 

QFLDKGDEDTV^LGSKRQYAIX:SEIFMX5YKLSGFYKIKPLQSPAEFSVYCDMSDGGG 
WTVIQIUISDGSENFNRGWKDYENGF 
DFEKNSRYAQYKNFKVGDEKNFYELM^ 
30 TWDWDHDNVEGNCAEEDQSGWWFT^ 

WYSLKSVVMKIRPNDFIPNVI; a polypeptide comprising the amino acid sequence of: 

SPISNCEITITDPGKFYNSNSWSRGNMAKW 

QVRLLETRVKQQQVXKQLLQENEVQFLDKG^ 

GFmKPLQSPAFJSWCDMSDGGGWTVIQPJISDGSENFNRGWDYENGFGNFVQKHGEY 
35 WLGNKM^HFLTTQEDYTIJOD 

SLAGNFHPEVQWASHQRMK FSTWDPJDHDNYEGNCAEEDQSGWWFNRCHSANLNGV 
YYSGPYTAKTDNGIVWYTWHGWWY SLKSWMKIR PNDFIPNVI; a polypeptide 
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comprising the amino acid sequence of: 
MAKWSFILVTTALMGR 

QFLDKGDEhnVVDLGSKRQYADCSEIFNDGYKLSGF^ 
WTVIQRRSDGSENFNRGWKDYENGFGNFVQKHGEY 
5 DFEKNSRYAQYKNFKVGDEKNFYELMGEYSGTAGDSLAGNFHPEVQWW 
TWDRDHDNYEGNCAEEDQSGWWFNR^ 

WSLKSVVMKIRPNDFIPNVI; and a polypeptide comprising the amino acid sequence of: 
MKLANWYWLSSAVLATYGFLWAN^ 
VSIJPLTIQLPKQFSRIEEVrTLEV^^ 
10 APGEVGDNRVREIJESEVNKLSSEIJ^ 

LTFVVNSLDGKCSKCPSQEQIQSRPVQHLIYKIX^SDYYAIGK^ 
CDMETMGGGWTVLQARLDGSTNFm 

LRIDLEDFNGVELYALYDQFWANEFLKYRLHVGNYNGTAGD 

PDKDNDRYPSGNCGLYYSSGWWF^^ 
1 5 AHPGGYKSSFKEAKMMIRPKHFKP. Also preferred are polypeptide fragments having a 

biological activity as described herein and the polynucleotides encoding the fragments. 

Proteins of SEQ ID NO:10 (Internal designation Clone 147103 _106-024-l-0-H6-F), SEQ ID 

NO:12 (Internal designation Clone 224168_116-096-3-0-Gll-F), SEQ ID NO:16 (Internal 

designation Clone 225432JL16-083-3-0-C6-F), and SEQ ID NO: 14 (Internal designation 
20 Clone 243303_116-118-4-0-A3-F) 

The polynucleotides of SEQ ID NOs:9, 1 1, 13 and 15 and the polypeptides of SEQ ID 

NOs:10, 12, 14, and 16, respectively, encode the soluble Low density lipoprotein receptor-Related 

Protein-10 (sLRPIO) 

MSASCCLSWCPAKAKSKCGPTFITCASGm 

25 ARYHCKNGLCIDKSFICDGQNNCQDNSDEESCESSQAIFPQITVS. Preferred polynucleotides 
and polypeptides of the invention comprise the nucleic acid sequences of SEQ ID NOs:9, 11,13, 
and 15 and amino acid sequences of SEQ ID NOs: 10, 12, 14, and 16. It will be appreciated that all 
characteristics and uses of the polynucleotides of SEQ ID NOs:9, 1 1, 13, and 15 and polypeptides 
of SEQ ID NOs: 10, 12, 14, and 16 described throughout the present application also pertain to the 

30 human cDNAs of Clones 147103J06-024-1-0-H6-F, 224168_116-096-3-0-GH-F, 243303J16- 
1 1 8-4-0-A3-F, and 225432_1 16-083-3-0-C6-F, and the polypeptides encoded thereby. Preferred 
compositions of the invention include polynucleotides and polypeptides of Clones 147103_106- 
024-1 -0-H6-F, 224168J 16-096-3-0-G1 1-F, 243303_1 16-1 18-4-0-A3-F, and 225432^1 16-083-3- 
0-C6-F; SEQ ID NOs:9, 11, 13, and 15; SEQ ID NOs:10, 12, 14, and 16. Also preferred are 

35 polypeptide fragments having a biological activity as described herein and the polynucleotides 
encoding the fragments. 

sLRPIO is a non-membrane, soluble member of the Low Density Lipoprotein Receptor 



102 



WO 02/094864 



PCT/IB01/01715 



(LDLR) family. This family is characterized by the presence of a number of conserved, cysteine- 
rich LDLR domains. This domain folds to form a defined ligand-binding structure. Most 
members of the LDLR family are transmembrane proteins that function in clathrin-mediated 
endocytosis of various ligands. These ligands are usually then destroyed by lysosomal 
5 degradation. However, shorter, secreted family members have been described (U.S. Patent 
5,496,926 and Quinn, K. et al, Exp. Cell Res. 251: 433-41(1999) which disclosures are hereby 
incorporated by reference in their entirety). The LDLR family of proteins is capable of binding a 
variety of protein and lipoprotein ligands. Furthermore, certain viruses target the LDLR domain to 
gain entry to cells expressing LDLR family members. LDLR proteins are expressed on a variety 
10 of cell types including hepatocytes, neurons, fibroblasts, epithelial, adipose, muscle, and pancreatic 
cells. 

High levels of Low Density Lipoprotein (LDL), Very Low Density Lipoprotein (VLDL), 
chylomicrons, and Apolipoprotein E (ApoE) are associated with atherosclerosis and other 
cholesterol-associated disorders. These molecules are subjects of intense study in the medical 
15 field. As a preferred embodiment, sLRPIO is used to bind LDL, VLDL, chylomicrons, and ApoE. 
While many members of the LDLR family, such as LDLR and alpha-2-macroglobulin receptor, 
are very large (>400 kD) membrane spanning proteins, sLRPIO is relatively small and not 
membrane associated Thus, sLRPIO is an easily purified polypeptide that can be used for binding 
LDLR domain ligands. As a part of this embodiment, sLRPIO polypeptide is covalently or non- 
20 covalently attached to a solid matrix and allowed to bind LDL, VLDL, chylomicrons, or ApoE in 
solution using techniques well known in the art. Once bound, these proteins can be purified using 
the following steps: i) wash the solid matrix to get rid of contaminants, ii) elute the protein of 
interest using more stringent conditions, e.g., increasing salt concentration. 

Additional aspects of this embodiment include methods of detecting and quantifying LDL, 
25 VLDL, chylomicrons, or ApoE bound to sLRPIO using techniques common in the art (e.g., 

Western blotting, ELISA, or use of a labeled secondary detection method) comprising the steps of 
obtaining a biological sample suspected of containing LDL, VLDL, chylomicrons, or ApoE; 
contacting said sample with an LDL, VLDL, chylomicrons, or ApoE binding sLRPIO polypeptide 
of the present invention under conditions suitable for binding of sLRPIO to LDL, VLDL, or ApoE; 
30 detecting the presence or absence of LDL, VLDL, or ApoE by detecting the presence or absence of 
sLRPIO bound to LDL, VLDL, or ApoE. This embodiment is usefid, for example, as a diagnostic 
tool for detecting plasma levels of these proteins. 

In another embodiment of the invention, the sLRPIO polypeptide is used to bind LDL, 
VLDL, chylomicrons, and ApoE in vivo and remove these molecules from the bloodstream. In 
35 this embodiment, the sLRPIO polypeptide may further be expressed as a fusion protein with a 
polypeptide signal specifying excretion from the body. The invention is delivered to individuals at 
risk of atherosclerosis or arterial lipoprotein deposits of LDL, VLDL, chylomicrons, or ApoE as 
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determined by common medical techniques including those described in U.S. Patent 5,652,224, 
incorporated herein by reference in its entirety, and comprising the steps of i) detennining the 
familial predisposition of the individual for these disorders, ii) obtaining a biological sample from 
the individual, and iii) subjecting that sample to analysis for lipoprotein content. Delivery includes 
5 administering an appropriate amount of sLRPIO polypeptide to the bloodstream of the diagnosed 
individual, e.g., by injection. 

ApoE is also associated with the pathogenesis of diabetes. Abnormally high levels of 
ApoE are linked to amyloid plaques and destruction of pancreatic P-cells. Furthermore, ApoE has 
antioxidant activity (Miyata and Smith, Nature Genet 14: 55-61 (1996) which disclosures are 

10 hereby incorporated by reference in their entirety) and oxidative damage destroys P-cells in type 1 
diabetes (Bach J., Endocrin. Rev. 15: 516-542 (1994) and PCT application W09846743, 
incorporated herein by reference in its entirety). This embodiment of the invention could further 
be delivered to patients suffering from or at risk of diabetes to reduce levels of pancreatic ApoE. hi 
this embodiment, the sLRPIO polypeptide may further be expressed as a fusion protein with a 

15 polypeptide signal specifying excretion from the body. An appropriate dosage of sLRPIO may be 
delivered specifically to the bloodstream, by injection for example, or to pancreatic cells using 
methods known in the art including those described in U.S. Patent 5,652,224, incorporated herein 
by reference in its entirety. These include steps comprising i) construction of a recombinant viral 
vector comprising the DNA of, or corresponding to, a portion of the genome of an adenovirus, 

20 which portion is capable of infecting a pancreatic cell, operatively linked to the nucleotide 

sequence of the invention and the regulatory sequences directing its expression; ii) delivery of an 
effective amount of the recombinant adenoviral vector to an individual at risk for diabetes. 

The polypeptide sLRPIO invention can bind ApoE as well as the amyloid precursor protein 
(APP), both of which are associated with the pathogenesis of Alzheimer's disease (Kounnas, M.Z., 

25 et al., Cell 82:331-40 (1995) which disclosures are hereby incorporated by reference in their 
entirety). As a further embodiment of the invention, sLRPIO polypeptide is used to bind these 
proteins in neuronal cell populations to allow study of Alzheimer's pathogenesis. In particular, the 
invention is directly added to a population of neurons to block ApoE activity and study the 
formation of amyloid plaques. 

30 sLRPIO is also able to bind the protooncogene Wnt-1 (Tamai, K., et al., Nature 407:530-35 

(2000) which disclosures are hereby incorporated by reference in their entirety). Wnt-1 usually 
functions as a soluble growth factor that binds to Frizzled receptors but Wnt-1 has also been 
associated with transformation of cells (van Ooyen, A., Cell 39:233-40 (1984) which disclosures 
are hereby incorporated by reference in their entirety). Additionally, Wnt-1 has been associated 

35 with schizophrenia (Shackleford, G., et al., Neuron 1 1 :865-75 (1993) which disclosures are hereby 
incorporated by reference in their entirety), making this protein of particular interest to the 
biomedical community. Another embodiment of the sLRPIO polypeptide invention provides a 
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method to study Wnt-1 and its effects using techniques common to the art. This embodiment 
provides a method of purifying Wnt-1 protein from a biological solution using steps comprising: 
i) attaching sLRPIO to a solid matrix; ii) applying a solution containing Wnt-1; iii) allowing Wnt-1 
to bind to sLRPIO; iv) washing and eluting Wnt-1. Purifying Wnt-1 is useful for a number of 
5 applications, for example to use purified Wnt-1 as a growth factor to administer to cells, to 
generate antibodies against Wnt-1, and others. Additionally, this embodiment of the sLRPIO 
polypeptide is used to bind Wnt-1 in solution and prevent its association with Frizzled receptors, 
thereby preventing molecular signaling events leading to cell growth, proliferation, and/or 
transformation. 

10 sLRPIO binds to viruses comprising the Rous sarcoma, Flaviviridae (including hepatitis 

C), and Rhinovirus (including those responsible for the "common cold") families (Bates, P., et al., 
Cell 93:1043-51 (1993), Agnello, V, et al., PNAS 96:12766-71 (1999), Hofer, F., et al, PNAS 
91 :1 839-42 (1994) which disclosures are hereby incorporated by reference in their entirety). As a 
preferred embodiment of the invention, the sLRPIO polypeptide is used to bind viruses in solution. 

15 This embodiment can be used to detect and quantify virus by techniques common to the art (e.g., 
fluorescent labeling of sLRPIO) comprising steps of obtaining a biological sample suspected of 
containing virus from at least one of the Rous sarcoma, Flaviviridae, or Rhinovirus families; 
contacting said sample with labeled or otherwise detectable sLRPIO polypeptide; and detecting 
and quantifying virus by visualizing the labeled sLRP 1 0. 

20 Membrane spanning LDLR family members are targeted by viruses of the Rous sarcoma, 

Flaviviridae, and Rhinovirus families for entry into cells. However, as sLRPIO is not associated 
with the cellular membrane, it acts to block viral binding to LDLR proteins on the cells that 
express these receptors, thereby preventing infection of those cells. As a preferred embodiment of 
the invention, the sLRPIO protein is used to bind virus and prevent infection of LDLR family- 

25 expressing cells using methods known in the art including U.S. Patent 5,496,926, incorporated 
herein by reference in its entirety. This embodiment may be carried out by steps comprising: 
i) adding the sLPRlO polypeptide directly to cells, e.g. cells that express an LDLR family receptor, 
that may be exposed to a viral sample and ii) preventing the infection of said cells by viruses of the 
Rous sarcoma, Flaviviridae, and Rhinovirus families. 

30 Protein of SEQ ID NO:20 (Internal designation Clone 158523J06-030-2-0-A3-F) 
The cDNA of Clone 158523 J06-030-2-0-A3-F (SEQ ID NO:19) encodes the 
OsteoAngioRemodeling (OAR) protein comprising the amino acid sequence 
MRAWIFFLLCLAGRAIJ\APQQE 
EETEEEWAEWCQNHHCXHGKVCELDENNTPMCV 

35 DSSCHFFATKCTLEGTKKGHKLHLDY^ 

RDEDMsDLLTEKQKLRVKKIHENEKRLEAGDOT (SEQ ID 

NO:20). Accordingly, it will be appreciated that all characteristics and uses of the polypeptides of 
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SEQ ID NO:20 described throughout the present application also pertain to the polypeptides 
encoded by the nucleic acids included in Clone 158523_106-030-2-0-A3-F. In addition, it will be 
appreciated that all characteristics and uses of the polynucleotides of SEQ ID NO: 19 described 
throughout the present application also pertain to the nucleic acids included in Clone 158523_106- 
5 030-2-0-A3-F. A preferred embodiment of the invention is directed toward the compositions 
comprising SEQ ID NO:19, SEQ ID NO:20, or Clone 158523 J06-030-2-0-A3-F. Also preferred 
are polypeptide fragments having a biological activity as described herein and the polynucleotides 
encoding the fragments. Another preferred embodiment of the invention is directed toward 
compositions comprising polypeptide fragments of at least six amino acids within SEQ ID NO:20: 
10 LLARDCQAVSARK, including those having a biological activity described herein, and the 
corresponding polynucleotides. Preferred polypeptides of the present invention include 
polypeptide fragments of SEQ ID NO:20 comprising 

KKIHENEKRLEAGDHPV^ and the corresponding 

polynucleotides. Further preferred polypeptides of the present invention include polypeptide 
1 5 fragements of SEQ ID NO:20 comprising 

DYIGPCKYIPPCLDSELTEFPIJRMRDWLKNVLVTLYEIU^ 

KRLEAGDHPVELLARDCQAVSARKAKIKSEM and the corresponding polynucleotides. 
Polypeptide fragments of SEQ ID NO:20 having a biological activity of those described herein and 
polynucleotides encoding the same are also included in the invention. Biological activities include 

20 increasing bone density when contacted with osteoblasts, tissue remodeling, and wound healing. 

The polypeptides of the OsteoAngioRemodeling (OAR) protein of SEQ ID NO:20 encode 
a carboxy-terminal variant of the human Osteonectin (also SPARC/ BM-40) protein. OAR is 
encoded by the polynucleotides of SEQ ID NO: 19 and represents an alternative splice variant of 
the full-length Osteonectin cDNA. This splice variant is characterized by the presence of an 

25 alternative carboxy-terminal 15 amino acids starting at residue 219 of the 303-amino acid 
Osteonectin protein. 

OAR, like Osteonectin, is a non-collagenous, extracellular matrix-associated protein. 
Expression is found in a number of cell types that include osteoblasts, platelets, and vascular 
epithelia, and is upregulated in sites of proliferation and extracellular matrix (ECM) remodeling. 

30 OAR is a modular protein whose domains mediate structure and protein-protein interactions. OAR 
lacks domain IV of full-length Osteonectin, which contains one of two EF-hand motifs. OAR 
binds molecules such as collagen, PDGF, and FGF. Collagen type binding specificity is in part 
determined by differential N-glycosylation of amino acids 71 and 99. This level of regulation is 
tissue-specific, so that OAR from the bone binds collagens I, HI, and V, yolk sac-derived OAR 

35 binds only m and V, and platelet-derived OAR does not bind collagen at all. Furthermore, binding 
decreases in low pH conditions. OAR plays a role in regulating cell mobility, proliferation, bone 
and tissue remodeling, and metalloproteinase production. OAR is involved in osteoporosis, 
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osteoarthritis, atherosclerosis, angiogenesis, obesity, and metastatic tumors. 

OAR is associated with increased bone density and remodeling. OAR is also associated 
with metalloproteinase production, which is vital for bone remodeling. As a preferred 
embodiment, the OAR polypeptide of the invention is used to increase the activity of osteoblasts 
5 using methods common to the art, for example, by adding a osteoblast-stimulating amount of OAR 
to increase bone production to a culture of osteoblastic cells. This embodiment is applied to 
increase the productivity of osteoblasts for purposes comprising study or replacement therapy. As 
a further embodiment, OAR is used in methods of bone remodeling such as those described in 
Gerber, H., et al. (1999) Nat. Med. 5:623-8, which disclosures are hereby incorporated by 

10 reference in their entirety. For example, OAR is used in a method to promote osteoblast 

differentiation and bone remodeling by inducing metalloproteinase or osteocalcin production by 
contacting OAR with osteoblastic cells in culture. Furthermore, OAR is used in a method to 
promote in vivo osteoblast differentiation by contacting OAR with an area of potential bone 
growth, for instance, in the growth plate of the femur or in the hip which is often the site of 

1 5 fracture. An e ffective amount of OAR is delivered to the site by injection or other methods 
common to the art and effectiveness determined using any suitable method such as X-rays, or 
methods described in Delany, A., et al. (2000) J. Clin. Invest. 105: 915-23, which disclosure is 
hereby incorporated by reference in its entirety. 

Cells derived from certain tissues adhere to specific collagens. OAR binds collagen types 

20 I, HI, and V which are found, for example, in epithelia and bone tissue. This allows OAR to act as 
an anti-adhesion factor by inhibiting normal interaction of collagen in the ECM to cell surface 
adhesion molecules. This activity is associated with cell migration and differentiation. 
Furthermore, OAR is associated with increased metalloproteinase expression, which leads to ECM 
degradation and tissue remodeling. Thus, a preferred embodiment of the invention is directed to a 
: 25 method of using OAR in tissue remodeling, whereby contacting OAR with osteoblasts to inhibit 
binding of collagen to cells allows tissue remodeling. Further preferred is a method to use OAR in 
wound healing (e.g., from surgical damage or chronic conditions such as diabetic ulcers), tissue 
grafts, necrotic or hypoxic tissue in ECM environments comprising collagen types I, HI, and V that 
bind OAR. A method to treat these conditions includes steps comprising: i) identifying the ECM 

30 of the tissue in need of repair as one that binds OAR using methods common in the art (e.g., 
applying fluorescently-labeled OAR to an ECM sample and visualizing by microscopy); 

ii) localizing an effective amount of OAR to the wound area either directly or by injection; 

iii) allowing ECM remodeling to occur as OAR inhibits cell adhesion. 

Osteonectin binds to VEGF, which regulates blood vessel formation. This interaction 
35 prevents VEGF binding to its receptor. The OAR polypeptide lacks a VEGF-binding domain 
while it retains its ability to bind the ECM and affect remodeling (Kupprion, C, et al. (1998) J. 
Biol. Chem. 273:29635-40 which disclosure is hereby incorporated by reference in its entirety). In 
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a preferred embodiment of the invention, OAR polypeptide is used to replace Osteonectin in 
conditions that require VEGF activity in addition to the ECM interactions that mediate wound 
healing and tissue remodeling. This is accomplished in steps comprising: i) obtaining a cell or 
tissue sample in culture that contains at least VEGF and VEGF-responsive cells; ii) adding OAR to 
5 the culture in an amount effective for ECM binding, iii) allowing OAR to enable ECM remodeling 
as well as VEGF signaling to aid in angiogenesis and tissue healing. In addition, expression of 
Osteonectin may be inhibited by introducing EL-1 to the affected area and as described in 
Nakamura, S., et al. (1996) Arthritis Rheum. 39:539-51, which disclosure is hereby incorporated 
by reference in its entirety. As a further embodiment, the invention is applied to the growth and 

10 healing of necrotic or hypoxic tissue, tissue grafts, and bone-associated tissue. The OAR 

polypeptide is delivered to these tissues using methods common to the art such as injection or use 
of OAR polypeptide fused to a targeting molecule specific for the tissue of interest. 

In the extreme, decreased "contact inhibition" from the ECM to the cell surface is linked to 
tumor formation and metastasis. As OAR inhibits contact of cells to specific types of collagen in 

15 the ECM, OAR is involved in metastasis of a number of tumor cell types including breast and 
prostate carcinomas. In a preferred embodiment of the invention, the OAR polypeptide is used to 
develop inhibitors of its collagen-binding activity to prevent ECM invasion. This invasion 
includes the proliferation of cells into inappropriate tissues, such as that observed in rheumatoid 
arthritis and cancers including breast and prostate carcinomas. Inhibitors of OAR are comprised of 

20 antibodies raised against the carboxy-terminal 1 5 amino acids of the OAR polypeptide and small 
molecules that interfere with OAR collagen binding activity. OAR binding to ECM environments 
is determined using methods common to the art such as applying fluorescently-labeled OAR to a 
tissue sample and visualizing by microscopy. Effectiveness of OAR inhibitors is determined using 
the aforementioned method or by observing cell invasion of the ECM as described by Kato, Y., et 

25 al. (1998-99) Invasion Metastasis 18:105-147, which disclosure is hereby incorporated by 

reference in its entirety. An example use of this embodiment would include methods comprising 
the steps: i) purifying the OAR inhibitor such as an antibody using methods common in the art 
(e.g.- affinity chromatography); ii) determining a site of inappropriate ECM invasion using 
methods common to the art such as tissue imaging, X-ray, or palpation; iii) localizing an effective 

30 amount of OAR inhibitor to the site to allow cell surface-collagen interactions and prevent ECM 
invasion. Localization of the OAR inhibitor is effected using methods common in the art such as 
injection. Further included in the invention is a method for delivering the OAR polypeptide fused 
to a targeting molecule specific for the tissue of interest. 

OAR binds to growth factors including PDGF, which can induce cell migration and proliferation, 
35 and inhibits binding of the growth factor to its receptor under certain conditions. As a preferred 
embodiment of the invention, the OAR polypeptide is used to inhibit signaling through growth 
factor receptors such as the PDGF receptor. This embodiment is useful in preventing inappropriate 
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growth of PDGF-responsive cells, such as dermal fibroblasts (e.g., in the case of hypertrophic 
scars) and platelets (e.g., in cases of malignant lymphomas). This embodiment is carried out, for 
instance to reduce the volume of a hypertrophic scar, by identifying a region with excess scar 
tissue using methods described by Nedelec, B., et al. (2000) J. Bum Care Rehabil. 21 :205-12, 
5 which disclosure is hereby incorporated by reference in its entirety; administering an effective 
amount of OAR to the scar directly or by injection; and monitoring the scar using aforementioned 
method or others common to the art. 

Protein of SEQIDNO:30 (Internal designation Clone 133431_105-092-4-0-Gll-F) 

The cDNA of clone 1 3343 1_1 05-092^-0-Gl 1-F (SEQ ID NO:29) encodes a variant of the ALEX- 
10 1 protein with the amino acid sequence 

MGRTREAGCV AAGWIGAGACY CVYRLAWGRDENEKIWDEDEESTDTSXIGVETVKGA 
KTNAGAGSGAKiQGDSEVKPEVSLGLEDCPGV^ 

QASAKAGKGARVGTISGNRTLAPSLPCPGGRGGGCHPTRSGSRAGGRASGK 
TRAPATHVPVRRGKFNFPYKJDDILSAPDL 

15 NQNAIRELGGWIIAKKKKK (SEQ ED NO:30). It will be appreciated that all characteristics and 
uses of polypeptides of SEQ ID NO:30 described throughout the present application also pertain to 
the polypeptides encoded by the nucleic acids included in Clone 13343 l_105-092-4-0-Gl 1-F. In 
addition, it will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NO:29 described throughout the present application also pertain to the nucleic acids included in 

20 Clone 133431_105-092-4-0-Gll-F. Apreferred embodiment of the invention is directed toward 
the compositions of SEQ ID NO:29, SEQ ID NO:30, and Clone 13343 1 J 05-092-4-0-G1 1-F. 
Also preferred are polypeptide fragments having a biological activity as described herein and the 
polynucleotides encoding the fragments. The gene of SEQ ID NO:29 is located on the X- 
chromosome. It encodes a new armadillo repeat protein with a death effector domain and is 

25 involved in cell-cell adhesion, cell signaling and apoptotic processes and is hereby referred to as 
Armapoptin. 

Armapoptin promotes cell growth and differentiation during embryonic development. It is 
part of multi-protein complexes, which mediate cell-cell adhesion, anchorage to the actin 
cytoskeleton with adjacent cells, and a signal in response to cell adhesion to initiate cell polarity 

30 and the formation of epithelia. Armapoptin complexes, which include E-cadherin and different 
cadherin-binding proteins including P-catenin can also be associated with a tumor suppressor 
protein such as Adenomatous Polyposis Coli (APC), which is mutated in hereditary colon cancer. 
Cell-cell adhesion in normal differentiation processes and malignant proliferation is mediated by 
the armadillo domain serving as a scaffold for the assembly of multi-protein complexes. 

35 The N-terminal region of Armapoptin contains a death effector domain (DED) comprising 

residues RLAWGRDENEKIWDEDEES. Death effector domains are involved in caspase- 
dependent apoptotic processes. Armapoptin is expressed in most tissues, but is not expressed or 
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significantly underexpressed in breast carcinoma biopsies of patients as well as in epithelial based- 
tumor cell lines including ovarian carcinoma, cervix adenocarcinoma cells, lung carcinomas, and 
immortalized endothelial cell lines such as t-HUE2. 

In an embodiment, Armapoptin polynucleotides are used in a method of gene therapies to 
5 restore cell-cell adhesion and to promote caspase-dependent apoptosis, preferably in epithelial cell- 
based tumors including breast carcinoma, ovarian carcinoma, lung carcinoma, non-small cell lung 
carcinoma (NSCLC), and squamous cell carcinoma of head and neck (SCCHN). Preferred 
compositions of Armapoptin to be used in methods of gene therapy, further referred to as "gene 
therapy compositions of Armapoptin" are compositions comprising the full-length DNA, SEQ ID 

10 NO:29, or fragments thereof, encoding a polypeptide or fragments thereof, including the sequences 
aatcctagtcttcgtttggtccggttgcactcttcctatagcccagagggcgagagggcctgtggcctgggggaaggaggacgaggttctgcc^ 
ggatcccagcaggacgctgtgccatttgggaacaaaggaatagtctgcctggaatccctgcagatcttggggccggaggccagtccaaccct 
tggagcaggaagaaacgcaaagttgtcaagaaccaagtcgagctgcctcagagccggcccgcagtagctgcagactccgcccgcgacgtg 
tgcgcgcttctctgggccagagcgagrctgtttt^ 

15 ctggtgcctgctactgtgtatacagactggcttggggaagagacgagaacgagaaaatctgggacgaagacgaggagtctacggacacctc 
akagattggggttgagactgtgaaaggagctaaaactaacgctggggcagggtctggggccaaacttcagggtgattcagaggtcaagcctg 
aggtgagtttgggactcgaggattgtccgggtgtaaaagagaaggcccattcaggatcccacagcggaggtggcctagaggccaaggccaa 
ggcccttttcaacacgctgaaggaacaggcaagtgcaaaggcaggcaaaggggctagggtgggtaccatctctgggaacaggacccttgca 
ccgagtttaccctgcccaggaggcaggggtggaggctgccaccccaccaggagtggatctagggccgggggcagggcaagtggaaaatc 

20 • caagggaaaggcccgaagtaagagcaccagggctccagctacaacatggcctgtrc^ 
tattctgagtgctcccgacctccaaaaggfcctcaacate 
aatgcagcatattcatttaaccagaatgccatacgtgaatt 
or 

tctgagtacc agctccccac tgccctgagg gcgggccggc ctgcggcgga gggaaaaaggaagaggagaa ggaaattgtc 
25 ccgaatccct gcagtgggtc caagcctctc ccgggtggccagtctttctg taggttgcgg cacaacgcca ggcaaaagaa 

gaggaaggaa tttaatcctaatcggtggag gtcgatttga gggtctgctg tagcaggtgg ctccgcttga agcgagggaggaagtttcct 
ccgatcagta gagattggaa agattgttgg gagtggcacaccactagggaaaagaagaag gggcgaactg cttgtcttga 
ggaggtcaac ccccacaatc agctcttgtggccttgaagt ggctgaagac gatcaccctc cacaggcttg agcccagtcc 
cacagccttcctcccccagc ctgagtgact actctattcc ttggtccctg ctattgtcgg ggacgattgcatgggctacg ccaggaaagt 
30 aggctgggtg accgcaggcc tggtgattgg ggctggcgcctgctattgca tttatagact gactagggga agaaaacaga 
acaaggaaaa aatggctgagggtggatctg gggatgtgga tgatgctggg gactgttctg gggccaggta 
taatgactggtctgatgatg atgatgacag caatgagagc aagagtatag tatggtaccc accttgggctcggattggga ctgaagctgg 
aaccagagct agggccaggg caagggccag ggctacccgggcacgtcggg ctgtccagaa acgggcttcc cccaattcag 
atgataccgt tttgtcccctcaagagctac aaaaggttct ttgcttggtt gagatgtctg aaaagcctta tattcttgaagcagctttaa 
35 ttgctctggg taacaatgct gcttatgcat ttaacagaga tattattcgtgatctgggtg gtctcccaat tgtcgcaaag attctcaata 
ctcgggatcc catagttaaggaaaaggctt taattgtcct gaataacttg agtgtgaatg ctgaaaatca gcgcaggcttaaagtataca 
tgaatcaagt gtgtgatgac acaatcactt ctcgcttgaa ctcatctgtgcagcttgctg gactgagatt gcttacaaat atgactgtta 
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ctaatgagta tcagcacatgcttgctaatt ccatttctga cttttttcgt ttattttcag cgggaaatga agaaaccaaacttcaggttc 
tgaaactcct tttgaatttg gctgaaaatc cagccatgac tagggaactgctcagggccc aagtaccatc ttcactgggc tccctcttta 
ataagaaaga gaacaaagaagttattctta aacttctggt catatttgag aacataaatg ataatttcaa atgggaagaaaatgaaccta 
ctcagaatca attcggtgaa ggttcacttt ttttcttttt aaaagaatttcaagtgtgtg ctgataaggt tctgggaata gaaagtcacc 
5 atgatttttt ggtgaaagtaaaagttggaa aattcatggc caaacttgct gaacatatgt tcccaaagag ccaggaataacaccttgatt 
ttgtaattta gaagcaacac acattgtaaa ctattcattt tctccaccttgtttatatgg taaaggaatc ctttcagctg ccagttttga 
ataatgaata tcatattgtatcatcaatgc tgatatttaa ctgagttggt ctttaggttt aagatggata aatgaatatcactacttgtt 
ctgaaaacat gtttgttgct ttttatctcg ctgcctagat tgaaatattttgctatttct tctgcataag tgacagtgaa ccaattcatc 
atgagtaagc tcccttctgtcattttcatt gatttaattt gtgtatcatc aataaaattg tatgttaatg ctggaagggaaaaaaaaaaa 

1 0 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaa. 

Further preferred are compositions comprising PCR-based subcloning of the gene therapy 
compositions of Armapoptin into plasmid vectors such as pCMVp or pSVp, tissue-specific 
• promoter-containing plasmids such as the MUC1 promoter, which allows epithelial cell specific 
expression and is up-regulated during malignancy, and the P450arom promoter II for breast 

15 carcinomas employing liposomal delivery systems by methods described in Patel, US Patent 
6,225,090, 2001, Thierry, US Patent 6,1 10,490, 2000; Wolff, et al., US Patent 6,228,844, 2001, 
Graham, et al., IntJ.Cancer 92:382-387, 2001, Zhou, et al, Cancer Res. 61:2328-2334, 2001, which 
disclosures are hereby incorporated in their entireties. Further preferred are compositions 
comprising polynucleotides of the invention cloned into adenoviral vectors (Beach, et al., US 

20 Patents 5,968,821, 1999, and 6,21 1,334, 2001; Mehtali, et al., US Patent 6,204,060, 2001), and 
MoMLV-based retroviral vectors for gene delivery into dividing cells, i.e. tumor tissues according 
to methods described by Holt, et al., US Patent 6,177,410, 2001, which disclosures are hereby 
incorporated in their entirety. 

Methods to deliver preferred compositions of Armapoptin polynucleotides and fragments 

25 thereof, comprise local injection of preferred compositions of the invention into tumor tissue or 
surrounding vessels, or ex vivo therapy. Further methods comprise tumor tissue specific targeting 
of Armapoptin polynucleotides or fragments thereof in a plasmid via antibodies or other ligands, 
which recognize tumor-specific receptors. These ligands will be covalently linked to polycations 
such as poly-L-lysine or liposomes, and complexed with preferred gene therapy compositions of 

30 Armapoptin. Preferred tumor cell types to be used in methods of gene therapy include breast 
carcinoma, cervix adenocarcinoma, ovarian carcinoma, lung carcinoma, and squamous cell 
carcinoma of head and neck derived from mammalian cells including rodent and human. 
Assessment of therapeutic efficacies will include tumor regression following delivery of preferred 
gene therapy compositions of Armapoptin as monitored by measurement of tumor circumference. 

35 Apoptosis will be measured by morphological assessments including retraction of cytoplasmic 
extension, cell rounding and detachment, and via MTT assays, which measure mitochondrial 
function for viability, cell death and caspase activity, and DNA fragmentation analysis as described 
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by Noteborn, et al. US Patent 5,981,502, 1999; Boone, et al., IBiol.Chem. 275:37596-37603, 
2000; Shibata, et al., Cancer Gene Therapy. 8:23-35, 2001; Lacour, et al., Cancer Research 
61:1645-1651, 2001), which disclosures are hereby incorporated by reference in their entireties. 
Further embodiments include putative death effector domains for therapeutic use in 

5 caspase-dependent cell death including incubation of carcinoma cells with compositions 

comprising polypeptides of preferred sequences comprising RLAWGRDENEKIWDEDEES and 
FADD DED-related domains as described in Eberstadt, et al., Nature.392:941-945, 1998, and 
Hackam, et al., J.Biol.Chem.275:41299-41308, 2000, which disclosures are hereby incorporated by 
reference in their entireties, with the consensus sequence 

10 SSYRVLLLLISEELDSEELEVLL^ 
LLYRLRRLDLLRRLFG. 

Further, these DED domain-encoding sequences will be subcloned into expression vectors 
and used for cell transfections and apoptosis studies as described above. 

In another embodiment, Annapoptin polypeptides or fragments thereof will be used as 

15 immunotherapeutics by covalent or noncovalent linkage to a cell-specific (e.g. tumor cell-specific) 
antibody, or to a ligand which is recognized by a tumor cell-specific receptor and internalized. 
Receptors which are abundantly expressed on tumor cells but not on intact, quiescent tissues to be 
employed in the present invention include HI 1 [(C-antigen); Dan, et al., US Patent 6,207,153, 
2001], tyrosine growth factor receptors including erbB-2 (HER-2-neu) (Suzuki, et al., Biochim 

20 Biophys Acta.l525:191-196, 2001; Kumar, et al., Semin Oncol.27:84-91, 2000; Lango, et al., 
Current Opin Oncol. 13: 168-175, 2001), the folate receptor (Ward, Current Opin Mol Ther.2:182- 
187, 2000), human epidermal growth factor receptor (Schlessinger, et al., US Patent 6,217,866, 
2001), and endoglin on endothelial cells for tumor vascular targeting (Seon, US Patent 6,200,566, 
200 1 ), which disclosures are hereby incorporated by reference in their entirety. 

25 The death effector domain causes neuronal cell death in Huntington's disease (Hackam, et 

al., J.Biol.Chem. 275:41299-41308, 2000, which disclosures are hereby incorporated by reference 
in their entirety) by stronger association with the mutant, glutamine rich protein, which causes the 
disease as opposed to wild-type huntingtin in healthy individuals. Another embodiment uses 
Armapoptin and ALEX-1, partial sequences thereof including the death effector domain 

30. RLAWGRDENEKIWDEDEES, and the death effector domain of the huntmgtm-interacting 
protein (HIP-1), conserved among related sequences with the consensus peptide 
SSYRVLLLLKEELDSEELEVUJ^ 

LLYRLRRLDLLRRLFG for competitive binding studies with wild-type huntingtin and the 
disease-causing mutant. By contacting polypeptides of the invention with wt- and mt- (glutamine- 
35 rich) huntingtin, peptide-protein interactions will be analyzed by biophysical methods and 

validated using the following steps as described in Scalley, et al., Biochemistry. 38:15927-15935, 
1999; Chaillan-Huntington et al., J Biol Chem. 275:5874-5879, 2000; Lohner et al., Biochim 
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Biophys Acta. 1462: 14 1-156, 1999; Eberstadt, et al., Nature 392:941-945, 1998, which disclosures 
are hereby incorporated in their entireties. 

Structural transitions in the denatured state ensemble, fluorescence energy transfer, and 
determination of peptide conformation and structural characteristics using circular dichroism 
5 Isothermal titration calorimetry, fluorescence binding assays, and differential scanning calorimetry 
to determine comparative values, and strength of interactions. 

Structure determination of polypeptide/htmtingtin complexes by NMR and X-ray 
crystallography. Co-incubation of cell lines like 293 T cells with protein-peptide complexes, and 
co-transfection of cells with wt- and mt-huntingtin- encoding plasmids and cloned oligonucleotides 

10 for cytotoxicity assays as well known in the art. 

Another embodiment includes the method to use armadillo repeats of armapoptin, including 
NFPYKIDDILSAro and 
YSFNQNAIRELGGWHAKL^ as single repeats, and naturally 

occurring tandem array repeats 

15 NFPYKTODILSAPDLQKV^ 

IKTKDPIIREKTYNALNNLSV for the restoration of cell-cell adhesion in treatment or prevention 
of cancer or other diseases or disorders where restoration of cell-cell adhesion is sought, wherein 
said method includes contacting cells in need of cell-cell adhesion with either monomers or 
concatamerized forms, either recombinantly or nonrecombinantly, such as dimmers, trimers, or 

20 longer repeats, in a cell-cell adhesion restorative amount of an Armapoptin polypeptide of the 
present invention. 

Protein of SEQ ID NO:26 (Internal designation Clone 545542_182-l-2-0-D12-F) 

The cDNA of clone 545542J 82-1 -2-0-D12-F (SEQ ID NO:25) encodes the 251 amino acid 
human Fibroblast Growth Factor-22 protein (FGF-22) comprising the amino acid sequence: 
25 MLGARLRLWVCALCSV^^ 
HXHDGAPHQTIYSALMIRSEDA 

LENGYDVYHSPQYHFLVSLGRAKRAFLPGMNPPPYSQFL^ 

EDDSERDPLNVIJCPRARMTPAPASCSQELPSAEDNSPMASDPLGVW 

EGCRPFAKFI (SEQ ID NO:26). Accordingly, it will be appreciated that all characteristics and 

30 uses of the polypeptides of SEQ ID NO:26 described throughout the present application also 
pertain to the polypeptides encoded by the nucleic acids included in Clone 545542_1 82-1-2-0- 
D12-F. In addition, it will be appreciated that all characteristics and uses of the polynucleotides of 
SEQ ID NO:25 described throughout the present application also pertain to the nucleic acids 
included in Clone 545542_182-l-2-0-D12-F. A preferred embodiment of the invention is directed 

35 toward the compositions of SEQ ID NO:25, SEQ ID NO:26, and Clone 545542 J 82-1-2-0-D12-F. 
Also preferred are polypeptide fragments having a biological activity as described herein and the 
polynucleotides encoding the fragments. 
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FGFs exert their biological effects through interaction with cognate single transmembrane, 
heparin-binding, fibroblast growth factor receptors (FGFR) with intrinsic kinase activity, 
designated fibroblast growth factor receptor 1 (FGFR-1), fibroblast growth factor receptor 2 
(FGFR-2), fibroblast growth factor receptor 3 (FGFR-3) and fibroblast growth factor receptor 4 
5 (FGFR-4). Physiologically, FGFs bind heparin sulfate proteoglycans which are sulfated 

glycosaminoglycans covalently bound to core protein. The ability to bind heparin : like moieties 
includes FGFs within the more encompassing Heparin Binding Growth Factor (HBGF) 
superfamily of peptide growth factors. Additionally, FGFs bind the cysteine-rich FGF-R (CFR), 
an integral single transmembrane protein in a mutually exclusive manner with respect to the other 
10 FGFRs. 

FGF-22 exibits a pattern of temporal and spatial expression in the embryonic and adult 
organism most pronounced in the brain, including but not limited to the ventrolateral thalamic 
nucleus and thalamus. FGF-22 is directly associated with the inherited disorder Autosomal 
Dominant Hypophosphatemic Rickets (ADHR), represented by missense mutations in FGF-22 

1 5 polypeptide residues ARG1 76GLN and ARG1 79TRP of SEQ ID 26, respectively, resulting from 
FGF-22 nucleotide transitions at position G527A and C535T, respectively of SEQ ID 25. 

Included as an embodiment of the present invention is a method of elevating serum 
phosphate levels to within physiologically acceptable concentrations comprising the step of 
contacting kidney tissue or cells, in vitro or in vivo, with an effective amount of a FGF-22 

20 polypeptide. The polypeptide of the present invention may be employed in combination with a 
suitable physiologically acceptable carrier to comprise a physiologically acceptable composition 
for administration. Such compositions comprise a therapeutically effective amount of the FGF-22 
polypeptide and a physiologically acceptable carrier or excipient. Such a carrier includes but is not 
limited to saline, buffered saline, dextrose, water, glycerol, ethanol, and combinations thereof. The 

25 formulation should suit the mode of administration. Preferably, the kidney cells are nephron renal 
tubules and associated vascular components (collectively designated the glomerular capsule) 
capable of altering tubular reabsorption, and/or distal or collecting tubules. Preferably; the kidney 
tissue or cell is contacted by administering a FGF-22 polypeptide to an individual. As used herein, 
the term "individual" includes members of the animal kingdom including but not limited to human 

30 beings. Preferably, the FGF-22 polypeptide is administered parenterally, more preferably 
intraperitoneal. 

Further included in the present invention is a method of attenuating osteomalacia or tumor- 
induced osteomalacia comprising contacting osseous tissue (osteocytes, osteoblasts, osteoclasts) 
with an osteomalacia inhibiting effective amount of a FGF-22 polypeptide. The polypeptide of the 
35 present invention may be employed in combination with a suitable physiologically acceptable 
carrier to comprise a physiologically acceptable composition for administration. Such 
compositions comprise a therapeutically effective amount of the FGF-22 polypeptide and a 
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physiologically acceptable earner or excipient. Such a carrier includes but is not limited to saline, 
buffered saline, dextrose, water, glycerol, ethanol, and combinations thereof. The formulation 
should suit the mode of administration. Preferably, the osseus tissue or cell is contacted by 
administering a FGF-22 polypeptide to an individual. Preferably, the FGF-22 polypeptide is 

5 administered parenterally, more preferably intraperitoneal. 

In another embodiment of the present invention is a method of attenuating osteopenia 
comprising contacting osseus tissue (osteocytes, osteoblasts, osteoclasts) with an osteopenia 
inhibiting effective amount of a FGF-22 polypeptide. The polypeptide of the present invention 
may be employed in combination with a suitable physiologically acceptable carrier to comprise a 

10 physiologically acceptable composition for administration. Such compositions comprise a 
therapeutically effective amount of the FGF-22 polypeptide and a physiologically acceptablely 
acceptable carrier or excipient. Such a carrier includes but is not limited to saline, buffered saline, 
dextrose, water, glycerol, ethanol, and combinations thereof. The formulation should suit the 
mode of administration. Preferably, the osseous tissue is contacted by administering a FGF-22 

15 polypeptide to an individual. Preferably, the FGF-22 polypeptide is administered parenterally, 
more preferably intraperitoneal. 

In another embodiment of the present invention is a method of attenuating osseous bone 
matrix deposition, including defects associated with congenital malformations, osteogenesis 
imperfecta (types I-IV), osteoporosis (type I and/or type II), rickets, fracture remodeling, surgical 

20 repair and restoration, and associated with deficiencies in osteoid mineralization or deposition, 
comprising contacting osseous tissue (osteocytes, osteoblasts, osteoclasts) with a osteoid 
deposition or osteoid mineralization stimulating effective amount of a FGF-22 polypeptide. The 
polypeptide of the present invention may be employed in combination with a suitable 
physiologically acceptable carrier to comprise a physiologically acceptable composition for 

25 administration. Such compositions comprise a therapeutically effective amount of the FGF-22 
polypeptide and a physiologically acceptablely acceptable carrier or excipient. Such a carrier 
includes but is not limited to saline, buffered saline, dextrose, water, glycerol, ethanol, and 
combinations thereof. The formulation should suit the mode of administration. Preferably, the 
osseous tissue is contacted by administering a FGF-22 polypeptide to an individual. Preferably, 

30 the FGF-22 polypeptide is administered parenterally, more preferably intraperitoneal. 

In another embodiment of the present invention is a method of attenuating bone resorption 
or jaw atropy associated with dental abscess (periapical or periodontal) formation or progression, 
congenital or derived edentulous conditions, or consequent to elective dental extraction, 
comprising contacting oral cavity osseous tissue (osteocytes, osteoblasts, osteoclasts) of the 

35 mandible or maxilla, preferably located adjacent to the sulcular groove region, with an effective 
amount of an FGF-22 polypeptide. The polypeptide of the present invention may be employed in 
combination with a suitable physiologically acceptable carrier to comprise a physiologically 
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acceptable composition for administration. Such compositions comprise a therapeutically effective 
amount of the FGF-22 polypeptide and a physiologically acceptablely acceptable carrier or 
excipient. Such a carrier includes but is not limited to saline, buffered saline, dextrose, water, 
glycerol, ethanol, and combinations thereof. The formulation should suit the mode of 
5 administration. Preferably, the osseous tissue is contacted by administering a FGF-22 polypeptide 
to an individual. Preferably, the FGF-22 polypeptide is administered parenterally, by any 
convenient manner, typically by syringe or catheter at the location of targeted osteosynthesis. 
In a further embodiment of this invention is a method of facilitating osseointegration of dental 
implant prostheses comprising contacting oral cavity osseous tissue (osteocytes, osteoblasts, 

10 osteoclasts) of the maxilla and/or mandible as well as osseous tissue i.e. autogeneic or allogeneic 
bone graft, or dental biomaterial matrix i.e. coral or hydroxyapatite, incorporated within the dental 
implant device, or bioabsorbable cement in peri-implant region with an effective amount of an 
FGF-22 polypeptide. Following tooth extraction, implant osteotomies were prepared and FGF-22 
polypeptide included with a bioabsorbable cement placed circumferentially within the osteotomies. 

1 5 Implant prostheses were placed into the prepared sites including the FGF-22 dental cement (Meraw 
et al., (J Periodontol 71: 8-13, 2000)). Preferably, the osseous tissue is contacted by administering 
a FGF-22 polypeptide to an individual. The polypeptide of the present invention may be employed 
in combination with a suitable physiologically acceptable carrier to comprise a physiologically 
acceptable composition for administration. Such compositions comprise a therapeutically effective 

20 amount ofthe FGF-22 polypeptide and a physiologically acceptable carrier or excipient Such a 
carrier includes but is not limited to saline, buffered saline, dextrose, water, glycerol, ethanol, and 
combinations thereof. The formulation should suit the mode of administration. Preferably, the 
FGF-22 polypeptide is administered parenterally, by any convenient manner, typically by syringe or 
catheter at the location of targeted osteosynthesis. FGF-22 polypeptide is alternatively or 

25 additionally administered directly associated with the biodegradable matrix of the dental implant 
using methods of Uli (US Patent 6,214,008/PCT W098/46289), Gayer and Comfort (US Patent 
6,214,049), and/or associated with the bioabsorbable cement using the methods of Meraw, et al. (J 
Periodontol 71: 8-13, 2000), which disclosures are hereby incorporated by reference in their 
entireties. 

30 A further embodiment of the current invention is a method of facilitating osteosynthesis of 

bone to attenuate acetablular erosion or osteonecrosis of the femoral head in advance of orthopedic 
osseointegration of hip joint implant prostheses for hip arthroplasty comprising contacting implant 
localized osseous tissue (osteocytes, osteoblasts, osteoclasts) of the hip joint, preferably the 
acetabular region and/or femoral head, with a stimulating effective amount of a FGF-22 

35 polypeptide. The polypeptide of the present invention may be employed in combination with a 
suitable physiologically acceptable carrier to comprise a physiologically acceptable composition 
for administration. Such compositions comprise a therapeutically effective amount of the FGF-22 
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polypeptide and a physiologically acceptable carrier or excipient. Such a carrier includes but is not 
limited to saline, buffered saline, dextrose, water, glycerol, ethanol, and combinations thereof. The 
formulation should suit the mode of administration. Preferably, the osseous tissue is contacted by 
administering a FGF-22 polypeptide to an individual. Preferably, the FGF-22 polypeptide is 
5 administered parenterally, preferably intraperitoneal, or by any convenient manner, or by syringe or 
catheter at the location of targeted osteosynthesis. FGF-22 polypeptide is additionally administered 
by incorporation with the biodegradable matrix of the prosthetic joint implant. 

An additional embodiment of this invention is a method of facilitating osteosynthesis of 
bone to attenuate articular surface erosion or osteonecrosis of the femur and/or tibia and/or patella 

10 in advance of orthopedic osseointegration of knee joint implant prostheses for knee joint 
arthroplasty, or osteochondral fracture repair, or the placement of orthopedic pins or screws, 
comprising contacting implant localized osseous tissue (osteocytes, osteoblasts, osteoclasts) of the 
knee joint, preferably the articular surfaces, with a stimulating effective amount of a FGF-22 
polypeptide. The polypeptide of the present invention may be employed in combination with a 

1 5 suitable physiologically acceptable carrier to comprise a physiologically acceptable composition 
for administration. Such compositions comprise a therapeutically effective amount of the FGF-22 
polypeptide and a physiologically acceptable carrier or excipient. Such a carrier includes but is not 
limited to saline, buffered saline, dextrose, water, glycerol, ethanol, and combinations thereof. The 
formulation should suit the mode of administration. Preferably, the osseous tissue is contacted by 

20 administering a FGF-22 polypeptide to an individual. Preferably, the FGF-22 polypeptide is 
administered parenterally, preferably intraperitoneal, or by any convenient manner, typically by 
syringe or catheter at the location of targeted osteosynthesis. FGF-22 polypeptide is additionally 
administered by incorporation with the biodegradable matrix of the prosthetic joint implant 
FGF-22 is a potent inducer of epithelial cell proliferation. Therefore, another embodiment of this 

25 invention is a method of stimulating epithelial cell proliferation or increasing epithelial cell 
viability by contacting said cells, in vitro or in vivo, with a proliferative stimulating or viability 
increasing effective amount of a FGF-22 polypeptide. More specifically a method of promoting 
wound repair or tissue healing, such as resultant from burn, ulcer (e.g., venous ulcers in diabetics), 
aging, post-operative damage, disease, or other insult, by stimulating epithelial cell proliferation or 

30 increasing epithelial cell viability by contacting said cells or tissue, in vitro or in vivo, with a 
proliferation stimulating or viability increasing effective amount of a FGF-22 polypeptide. The 
polypeptide of the present invention may be employed in combination with a suitable 
physiologically acceptable carrier to comprise a physiologically acceptable composition for 
administration. Such compositions comprise a therapeutically effective amount of the FGF-22 

35 polypeptide and a physiologically acceptable carrier or excipient. Such a carrier includes but is not 
limited to saline, buffered saline, dextrose, water, glycerol, ethanol, and combinations thereof. The 
formulation should suit the mode of administration. Preferably, the epithelial tissue is contacted by 
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administering a FGF-22 polypeptide to an individual. Preferably, the FGF-22 polypeptide is 
administered parenterally, preferably intraperitoneal, or by any convenient manner, typically by 
syringe or catheter directly at the location of targeted epithelial proliferation. 

FGF-22 is a potent regulator of connective tissue proliferation, including embryonic 
5 mesechymal cells, fibrobastic cells of areolar, collagenous and elastic connective tissue, 

chondrocytes of cartilage and osteocytes of bone. Therefore, another embodiment of this invention 
is a method of stimulating fibroblast cell proliferation or increasing fibroblast cell viability by 
contacting said cells, in vitro or in vivo, with a proliferative stimulating or viability increasing 
effective amount of a FGF-22 polypeptide. A further specified embodiment of the present 

10 invention is a method of promoting wound repair or tissue healing, in vitro and in vivo, such as 
resultant from burn, ulcer, aging, post-operative damage such as tendon and ligament repair (Chan, 
et al., Acta Orthop Scand 71:513-518, 2000; Kuroda, et al., Knee Surg Sports Traumatol Arthrosc 
8: 120-126, 2000), disease, or other insult, by stimulating connective tissue cell proliferation or 
increasing connective tissue cell viability by contacting said cells, in vitro or in vivo, with a 

15 proliferation stimulating or viability increasing effective amount of a FGF-22 polypeptide. The 
polypeptide of the present invention may be employed in combination with a suitable 
physiologically acceptable carrier to comprise a physiologically acceptable composition for 
administration. Such compositions comprise a therapeutically effective amount of the FGF-22 
polypeptide and a physiologically acceptablely acceptable carrier or excipient. Such a carrier 

20 includes but is not limited to saline, buffered saline, dextrose, water, glycerol, ethanol, and 

combinations thereof. The formulation should suit the mode of administration. Preferably the cells 
are located in tendons, ligaments, and synovial membranes. More specifically the cells would be 
fibroblasts present in loose, dense, collagenous and elastic connective tissues of the tendons and/or 
ligaments and/or synoviocytes within synovial membranes and contacted using the methods of 

25 Chan, et al. (Acta Orthop Scand 71: 513-518, 2000) and Kuroda, et al. (Knee Surg Sports 

Traumatol Arthrosc 8: 120-126, 2000), which disclosures are hereby incorporated by reference in 
their entirety. More preferably the fibroblasts would be induced to actively synthesize dense 
connective tissue and/or collagen. Preferably, the connective tissue is contacted by administering a 
FGF-22 polypeptide to an individual. Preferably, the FGF-22 polypeptide is administered 

30 parenterally, more preferably intraperitoneal. 

A further specified embodiment of the present invention is a method of promoting cartilage 
(hyaline cartilage, fibrocartilage, elastic cartilage) wound repair or tissue healing, in vitro and in 
vivo, such as resultant from aging, post-operative damage, disease, or other insult, by stimulating 
cartilage tissue cell proliferation or increasing cartilage tissue cell viability by contacting said cells, 

35 in vitro or in vivo, with a proliferation stimulating or viability increasing effective amount of a 
FGF-22 polypeptide. The polypeptide of the present invention may be employed in combination 
with a suitable physiologically acceptable carrier to comprise a physiologically acceptable 
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composition for administration. Such compositions comprise a therapeutically effective amount of 
the FGF-22 polypeptide and a physiologically acceptable carrier or excipient. Such a carrier 
includes but is not limited to saline, buffered saline, dextrose, water, glycerol, ethanol, and 
combinations thereof. The formulation should suit the mode of administration. Preferably the 
5 cells are located within joints and/or articular surfaces involved in interstitial/endogenous growth 
and/or appositional/exogenous growth, ends of long bones (articular cartilage), ends of ribs (costal 
cartilage), intervertebral disks, symphysis of pubis, menisci of knee, nasal septum, larynx, pharynx, 
trachea, bronchi, epiglottis, sternum, Eustachian tubes, and of the external (pinna), middle, and 
inner ear. More specifically the cells would be ground substance (collagenous or elastic fibers, 

10 glycosaminoglycans, chondroitin sulfate matrix) remodeling cells (chondrocytes,chondroblasts, 
chondroclasts) present in cartilagenous connective tissues and contacted using the methods of 
Toolan, et al. (J Biomed Mater Res 31: 273-280, 1996), Shida, et al. (J Orthop Res 14: 265-272, 
1996), and/or Chan, et al. (Clin Orthop 342: 239-247, 1997), which disclosures are hereby 
incorporated by reference in their entirety. More preferably the cartilage cells (chondrocytes, 

15 chondroblasts) would be induced to actively synthesize ground substance. Preferably, the 

connective tissue is contacted by administering a FGF-22 polypeptide to an individual. Preferably, 
the FGF-22 polypeptide is administered parenterally, preferably intraperitoneal, or by any 
convenient manner, or by syringe or catheter at the location of targeted cartilage connective tissue 
biosynthesis (Chan et al., Clin Orthop 342: 239-247, 1997). 

20 A further specified embodiment of the present invention is a method of promoting osseous 

(compact bone, spongy bone) wound repair or tissue healing, in vitro and in vivo, such as resultant 
from aging, post-operative damage, disease, or other insult, by stimulating osseous connective 
tissue cell (osteoblast progenitor stromal stem cell, osteocyte, osteoblast, osteoclast) proliferation 
or increasing osseus connective tissue cell viability by contacting said cells, in vitro or in vivo, 

25 with a proliferation stimulating or viability increasing effective amount of a FGF-22 polypeptide. 
The polypeptide of the present invention may be employed in combination with a suitable 
physiologically acceptable carrier to comprise a physiologically acceptable composition for 
administration. Such compositions comprise a therapeutically effective amount of the FGF-22 
polypeptide and a physiologically acceptable carrier or excipient. Such a carrier includes but is not 

30 limited to saline, buffered saline, dextrose, water, glycerol, ethanol, and combinations thereof. The 
formulation should suit the mode of administration. Preferably the cells (osteoblast progenitor 
stromal stem cell, osteocytes, osteoblast) would be induced to actively synthesize intestitial matrix 
substance containing mineral salts such as calcium phosphate and calcium carbonate as well as 
collagenous fibers. The osseous tissue cells would be contacted using the methods of Mathijssen, 

35 et al. (J Craniofac Genet Dev Biol 20: 127-136, 2000), Reiff, et al. (J Trauma 50: 433-438, 2001) 
and/or Mackenzie, et al. (Plast Reconstr Surg 107: 989-996, 2001). In response to FGF-22 
treatment, radiomorphometric (percentage of radiopacity of defect) and histomorphometric (square 
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millimeters of new bone formation) methods would be used to derive quantitative outcome data, 
bone formation). Preferably, the osseus connective tissue is contacted by administering a FGF-22 
polypeptide to an individual. Preferably, the FGF-22 polypeptide is administered parenterally, 
preferably intraperitoneal, or by any convenient manner, or by syringe or catheter at the location of 
5 targeted osteosynthesis (Radomsky, et al., Clin Orthop 355 Suppl: S283-S293, 1998), or by directed 
intraosseous injection using the methods of (Nakamura, et al., J Orthop Res 15: 307-313, 1996; 
Nakamura et al., Int Orthop 22: 49-54, 1998). 

FGF-22 is expressed in the ventrolateral thalamic nucleus of the CNS, a region associated 
with paralysis agitans, or Parkinson's Disease. Surgical intervention using thalamatomy for 

1Q Parkinson's disease involves introduction of lesions in the ventrolateral thalamus to relieve tremor 
and improve rigidity. Therefore, a further embodiment of this invention is a method of attenuating 
Parkinson's Disease associated tremors, or unrelated benign essential tremors, by contacting 
ventrolateral thalamic tissue comprising the steps of contacting said cells with an effective amount 
of a FGF-22 polypeptide. Another aspect of the present invention relates to a method for 

15 enhancing and/or stimulating and/or maintaining and/or regenerating the formation and/or survival 
of neurons in vitro or in the central nervous system of a mammal which comprises contacting 
neurons or neural progenitor cells, e.g., in vitro or by administering to said mammal, an effective 
amount of FGF-22 for a time and under conditions sufficient to effect an increase in and/or to 
maintain the number of neurons in the central nervous system. Prefereably the cells and/or tissue 

20 is located within the thalamic region of the CNS. More preferably the cells and/or tissue are of the 
thalamic ventral nuclei. The polypeptide of the present invention may be employed in combination 
with a suitable physiologically acceptable carrier to comprise a physiologically acceptable 
. composition for administration. Such compositions comprise a therapeutically effective amount of 
the FGF-22 polypeptide and a physiologically acceptablely acceptable carrier or excipient. Such a 

25 carrier includes but is not limited to saline, buffered saline, dextrose, water, glycerol, ethanol, and 
combinations thereof. The formulation should suit the mode of administration. Preferably, the 
CNS tissue is contacted by administering a FGF-22 polypeptide to an individual. Preferably, the 
FGF-22 polypeptide is administered parenterally, with the route of administration intraperitoneal, 
intramuscular, or by intravenous injection, or using gene therapy, although additional routes are 

30 possible such as infusion, drip, intracerebral injection (Mufson, et al., Prog Neurobiol 57: 45 1-484, 
1999) and/or implants (Shults, et al., Brain Res 883: 192-204, 2000; Tornqvist, et al., Exp Neurol 
164: 130-138, 2000) and as described in US 6,179,826, which disclosures are hereby incorporated 
by reference in their entireties. FGF-22 may also be administered directly to the brain. In an 
additional embodiment of this invention, FGF-22 may also be employed to stimulate neuronal 

35 growth and to treat and prevent neuronal damage associated with stroke or which occurs in certain 
neuronal disorders or neurodegenerative conditions such as Alzheimer's and AIDS-related 
complex. 
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The Adeno Associated Virus (AAV) utilizes the human FGFR-1 as a co-receptor for 
infection in mammalian cells (Qing, et al, Nat Med 5: 71-77, 1999, which disclosures are herehy 
incorporated by reference in their entirety) as well as the ubiquitously expressed heparan sulfate 
proteoglycans on cell surfaces. Similarly, adenoviral vectors are effectively targeted for the 
5 treatment of systemic and local disease using the ability of FGF family polypeptides to bind their 
cognate FGFR's with high affinity (Sosnowski, et al, Curr Opin Mol Ther 1: 573-579, 1999, 
which disclosures are hereby incorporated by reference in their entirety). As a further embodiment 
of this invention is a method of retargeting a FGF-22 polypeptide or chimeric polypeptide encoded 
as part of an adenoviral or AAV delivery system to cells expressing cognate FGFR complexes 

10 using the methods of Hoganson, et al, (Mol Ther 3: 105-1 12, 2001) and Qing, et al (Nat Med 5: 
71-77, 1999), which disclosures are hereby incorporated by reference in their entirety. Preferably 
the FGF-22 polypeptide is expressed, in part or in whole, with the viral delivery system as a 
Afunctional conjugate consisting of a blocking anti-adenoviral knob Fab fragment linked to FGF- 
22 using the methods of Goldman, et al (Cancer Res 57:1447-51, 1997) and Doukas, et al 

15 (FASEB J 13:1459-66, 1999). Preferably the FGFR complex is the FGFR-1 polypeptide or FGFR- 
1 polypeptide ligand binding moiety. 

Protein of SEQ ID NO:18 (Internal designation Clone 229633_253-2-5-2-All-F) 

The cDNA of Clone 229633^25 3-2-5 -2-A1 1-F (SEQ ID NO: 17) encodes the STAM- 
SAPper (STAMSAP) protein comprising the amino acid sequence: 
20 MDRALQVLQSroFTOSKPDSQDLLDL 

LELYNKLVNEAPWSVYSKLHPPAHYPPASSGWM 
SYSLGPDQIGPLRSLPPNVNSSVTAQPAQ 

TQQMGMSVDMSSYQNTTSNLPQLA (SEQ ID 

NO: 1 8). Accordingly, it will be appreciated that all characteristics and uses of the polypeptides of 

25 SEQ ID NO: 18 described throughout the present application also pertain to the polypeptides 
encoded by the nucleic acids included in Clone 229633_253-2-5-2-Al 1-F. In addition, it will be 
appreciated that all characteristics and uses of the polynucleotides of SEQ ID NO: 17 described 
throughout the present application also pertain to the nucleic acids included in Clone 229633_253- 
2-5-2-A1 1-F. A preferred embodiment of the invention is directed toward the compositions 

30 comprising SEQ ID NO:17, SEQ ID NO:18, and Clone 229633_253-2-5-2-Al 1-F. Another 

preferred embodiment of the invention is directed toward compositions comprising polynucleotide 
fragments of at least eighteen contiguous nucleotides selected from: 

gagcaagacgtggtgatgccaattggtggaaaggagaaaatcac, preferably those polynucleotides that encode for 
polypeptides having a biological activity described herein. Further preferred polynucleotides of 
35 the present invention include nucleic acids comprising: 

gaagcggmgsggtctagggagccgcggccgcgggtcacccggcgggtagcagttgctgagtgtcagctagacagcagcgactagggct 
cgggcgccggcgagatgcctttgttcaccgccaacccctte^ 
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preferably those that encode for polypeptides having a biological activity described herein. Further 
preferred polynucleotides of the present invention include nucleic acids of SEQ ID NO: 17 
comprising 

gaagcggmgsggtctagggagccgcggccgcgggtcacccggcgggtagcagttgctgagtgtcagctagacagcagcgactagggct 

5 cgggcgccggcgagatgcctttgttcaccgccaaccccttcgagcaagacgtggtgatgccaattggtggaaaggagaaaatcacagagga 
ataggactmcccatccaatmgtaacaactaatttaaacatagagactgaggcagcggctgtggacaaattgaatg^ 
aggaaattaagaaatcagagcctgagcctgtttatatagatgaggataagatggatagagccctgcaggtacttcagagtatagatccaacaga 
ttcaaaaccagactcccaagaccttttggatttagaagatatctgccaaca preferably those that encode for polypeptides 
of having a biological activity described herein. Polypeptides of the invention having a biological 

10 activity of x%, where x is any integer between 1 and 100 of those described herein and 
polynucleotides encoding the same are also included in the invention. Polypeptides of the 
invention with biological activity are defined as polypeptides that can be phosphorylated by a 
tyrosine kinase such as a Janus kinase (Jak). 

STAMS AP protein results from a splice event within the Signal Transducing Adaptor 

15 Molecule (STAM)-2 transcript. This splice variant contacts or recombines nucleotide 152 of 
STAM-2 with nucleotide 817. The resulting STAMS AP splice variant encodes the carboxy- 
tenninal 228 amino acids of the 525-amino acid STAM-2 protein. STAM-2 contains three well- 
characterized domains. The first is an SIB domain spanning amino acids 212-266 that is not 
shared with STAMSAP. This SH3 domain binds the downstream effector of STAM-2, AMSH, 

20 which activates proto-oncogenic transcription factors comprising c-myc and AP-1, and results in 
responses that include cell proliferation (Tanaka, N., et al. (1999) J. Biol. Chem. 274:19129-35 
which disclosure is hereby incorporated by reference in its entirety). An Lnmunoreceptor 
Tyrosine-based Activation Motif (ITAM) spanning amino acids 359-387 of STAM-2 and a 
carboxy-terminal tyrosine-rich domain are shared with STAMSAP (Endo, K., et al. (2000) FEBS 

25 Let. 477:55-61 and Pandey, A., et al. (2000) J. Biol. Chem. 275:38633-9 which disclosures are 
hereby incorporated by reference in their entireties). 

STAMSAP is phosphorylated on tyrosine residues within the ITAM and carboxy-terminal 
domains by Jak molecules comprising Jak2 and Jak3. Jak2 and Jak3 phosphorylate STAMSAP in 
response to ligand binding of cell surface receptors comprising IL-2R, IL-3R, IL-4R, IL-7R, 

30 Platelet Derived Growth Factor Receptor (PDGFR), Epidermal Growth Factor Receptor (EGFR), 
and Granulocyte Macrophage Colony Stimulating Factor Receptor (GM-CSFR). Jak activation 
and subsequent gene expression is associated with proliferation and cancers comprising breast and 
colon carcinomas and B ceil lymphomas (Yamauchi, T., et al. (2000) J. Biol. Chem. 275:33937-44; 
Kaulsay, K., (2000) Endocrinology 141:1571-84; U.S. Patent 6177433 which disclosures are 

35 hereby incorporated by reference in their entireties). Jak is often hyperactivated due to abnormally 
high expression of upstream receptors or their ligands in cancer cells. For example, higher than 
normal levels of PDGF are indicative of advanced stages of breast cancer (Seymour, L., et al. 
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(1993) Breast Cancer Res. Treat. 26:247-52 which disclosure is hereby incorporated by reference 
in its entirety). EGFR is overexpressed in a variety of tumors including cervical cancer (Mathur, 
R., et al. (2000) Am. J. Reprod. Immunol. 44:1 14-20 which disclosure is hereby incorporated by 
reference in its entirety). Furthermore, Jak3 is activated in stimulated mast cells, causing 
5 degranulation and subsequent allergic reactions (U.S. Patent 6177433 which disclosure is hereby 
incorporated by reference in its entirety). 

STAMSAP does not have a downstream effector and therefore acts as a dominant negative 
inhibitor of Jak signaling. In a preferred embodiment of the invention, the STAMSAP polypeptide 
is used to inhibit cell proliferation, cell survival, or viral replication downstream of Jak signaling. 

10 This embodiment is accomplished by methods comprising the step of delivering STAMSAP to 
cells responsive to activated Jak, for example, MOLT-4 cells expressing IL-2R (ATCC number 
CRL-1582). Methods for delivering STAMSAP to Jak-resposive cells include contacting said cells 
with STAMSAP polynucleotides or polypeptides by methods common to the art as discussed in the 
following paragraph. Further included in this embodiment is a polynucleotide comprising 

1 5 polynucleotides encoding a STAMSAP polypeptide with biological activity operably linked to an 
expression control element such as a promotor. Said polynucleotide is delivered to Jak-responsive 
cells by methods common to the art such as electroporation or transfection of naked 
polynucleotides. In addition, genes activated by Jak signaling may be monitored or assayed using 
methods common to the art, for example, reporter gene assays such as luciferase or beta- 

20 galactosidase. This embodiment is applied to, for example, inhibiting Jak-dependent cell responses 
in vitro. 

Another preferred embodiment of the invention is directed towards methods to use 
STAMSAP to inhibit Jak-induced cell proliferation. In particular, this embodiment is directed 
toward inhibiting proliferation of cells resulting from activation of any upstream effector of Jak, 

25 such as a growth factor. Preferred upstream effector molecules include but are not limited to: 
PDGFR, EGFR, IL-2R, EL-3R, IL-4R, IL-7R, and GM-CSFR. STAMSAP is used in this method 
comprising the step of introducing a STAMSAP polypeptide or a polynucleotide comprising 
polynucleotides encoding said polypeptide operably linked to an expression control element into 
cells activated by Jak or any upstream effector of Jak (e.g., cervical cancer cells stimulated with 

30 EGF). Preferred control elements express an amount of STAMSAP effective to inhibit 

proliferation of cells to which the invention is delivered. Alternative preferred control elements 
comprise cell- or tissue-specific enhancer elements, for example, the lyn enhancer for B cells, or c- 
myc or AP-1 sites for proliferating cells. Said polypeptides or polynucleotides are introduced into 
said cells using methods common to the art, including but not limited to lipid vesicles or viral 

35 transduction, as described in any one of the list: U.S. Patent 5616565, U.S. Patent 61 10490, U.S. 
Patent 6204060, or WO9704748 which disclosures are hereby incorporated by reference in their 
entireties. For example, polynucleotides are delivered to said cells by: i) compressing a 
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polynucleotide expression unit, preferably an expression unit containing polynucleotides encoding 
biologically active STAMSAP polypeptide, into a lipid vesicle derived from any of the following 
list: viral envelopes, liposomes, micelles, and modified versions of these, as described in U.S. 
Patent 61 10490 or P.C.T.904748, which disclosures are hereby incorporated by reference in their 
5 entireties; ii) optionally targeting the lipid vesicle to specific cells, for example, by embedding a 
member of a receptor-receptor ligand pair into the lipid envelope (e.g., CD40 ligand for targeting 
to B cells); iii) contacting the targeted vesicle with specific cells by methods common to the art 
such as injection or inhalant (U.S. Patent 61 10490, P.C.T 9704748, and U.S. Patent 6034062 
which disclosures are hereby incorporated by reference in their entireties). An example of 

10 delivering polypeptides to said cells comprises the steps: i) packaging a biologically active 
STAMSAP polypeptide into a lipid vesicle; ii) targeting the lipid vesicle to specific cells, for 
example, by including a member of a receptor- receptor ligand pair in the lipid envelope; 
iii) embedding a fusogenic component such as a peptide in the lipid envelope to promote delivery 
of encapsulated polypeptides to target cells; and iv) contacting the targeted vesicle with specific 

1 5 cells by injection or inhalant (P.C.T. 9704748 and U.S. Patent 6034062 which disclosures are 
hereby incorporated by reference in their entireties). 

In another preferred embodiment, STAMSAP is used to inhibit Jak3 in cells that induce an 
inflammatory response, such as mast cells, eosinophils, T cells, and B cells. This embodiment 
includes a method to deliver a biologically active STAMSAP polypeptide or a polynucleotide 

20 comprising polynucleotides encoding said polypeptide operably linked to an expression control 
element to individuals displaying the effects of an inflammatory response (e.g., allergic rhinitis 
(hay fever), allergic urticaria (hives), angioedema, allergic asthma, or anaphylaxis). Preferred 
methods of delivery include but are not limited to a method comprising the steps: i) packaging of 
said polynucleotide into a lipid vesicle as described in U.S. Patent 61 10490, U.S. Patent 5616565, 

25 and P.C.T. 9704748 which disclosures are hereby incorporated by reference in their entireties, and 
ii) delivering the vesicle to cells that induce an allergic response, such as mast cells, so that 
STAMSAP polypeptide is contacted with the relevant intracellular site. Preferred control elements 
direct expression of an amount of STAMSAP effective to inhibit an inflammatory response. 
Further preferred control elements for use in this embodiment include promoters of cell-specific 

30 genes such as CD48 in mast cells. The lipid vesicle is derived from any of the following list: viral 
envelopes, liposomes, micelles, and modified versions of these. Targeting of vesicles to specific 
cell types, as referred to in step (ii), is effected by embedding a targeting moiety such as a member 
of a receptor- receptor ligand pair into the lipid envelope of the vesicle. Useful targeting moieties 
specifically bind cell surface ligands, such as CD48 or the SCF receptor on mast cells. Thus, anti- 

35 CD48 antibodies or SCF ligand are examples of useful mast cell-targeting moieties. In addition, 
the antibodies B43 and TXU are useful for B and T cells, respectively. Vectors and targeting are 
further described in U.S. Patent 6177433, U.S. Patent 61 10490, and P.C.T. 9704748, which 
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disclosures are hereby incorporated by reference in their entireties. The invention is delivered to 
the appropriate site by methods common to the art such as injection or inhalant as described in U.S. 
Patent 6177433 and U.S. Patent 6034062, which disclosures are hereby incorporated by reference 
in their entireties. 

5 Protein of SEQ ID NO:22 (Internal designation Clone 589198_184-1 1-1-0-E4-F) 

The cDNA of Clone 589198 J 84-1 1-1-0-E4-F (SEQ ID NO:21) encodes the Corneal 
Osteo-Vascular Inducing (COVI) protein comprising the amino acid sequence: 
MKTLQSTLLLLLLWLIKPAPPTQQDSREYDYGTO 
VDPNEKSLQLQKDEAITPLPPKK^ 
10 YARFNKIKKLTAKI^ 
KLTLFNAKYNKIK^ 

TDDTFCKANDTSYIRD (SEQ ID NO:22). 

Accordingly, it will be appreciated that all characteristics and uses of the polypeptides of SEQ ID 

NO:22 described throughout the present application also pertain to the polypeptides encoded by the 
15 nucleic acids included in Clone 589198_184-1 1-1-0-E4-F. In addition, it will be appreciated that 

all characteristics and uses of the polynucleotides of SEQ ID NO:21 described throughout the 

present application also pertain to the nucleic acids included in Clone 589198_184-1 1-1-0-E4-F. 

A preferred embodiment of the invention is directed toward the compositions of SEQ ID NO:21, 

SEQ ID NO:22, and Clone 589198_184-1 1-1-0-E4-R Further included in the invention are 
20 polypeptide fragments at least seven amino acids in length of SEQ ID NO:22 and those having a 

biological activity of those described herein and polynucleotides encoding the same. Biological 

activities include but are not limited to increasing bone density when contacted with osteogenic 

cells and remodeling of vascular tissue. 

The COVI polypeptide is a unique splice variant of the mimecan (also called osteoglycin 
25 and osteoinductive factor) gene (Kukita, A, et al. (1990) Proc. Natl. Acad. Sci. 87:3023-6, 

Funderburgh, J., et al. (1997) J. Biol. Chem. 272:28089-95, and Tasheva, E., et al. (1999) J. Biol. 

Chem. 274:18693-701 which disclosures are hereby incorporated by reference in their entireties). 

The 1997 base pair COVE transcript begins in exon 3 of full-length mimecan and encodes a 298 

amino acid protein. 

30 The COVI polypeptide is a secreted protein associated with the extracellular matrix (ECM) 

that promotes growth and remodeling of bone. In a preferred embodiment of the invention, COVI 
polypeptide is used in a method to promote bone growth by contacting a bone growth-stimulating 
effective amount of COVI polypeptide with cells. Preferred cells are those that normally produce 
bone tissue, including but not limited to osteoblasts, osteocytes, and their precursors. This method 

35 is useful to facilitate bone growth in cases including but not limited to bone loss, atrophy, or 
malformation due to injury, congenital or chronic conditions, surgery, or disease. Examples 
include but are not limited to osteopenia, osteoporosis, rickets, malignant melanoma-induced bone 
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degradation, and bone fissures or fractures due to injury, elective surgery (e.g., plastic surgery), 
reconstructive surgery, and dental procedures or surgeries. CO VI polypeptides are delivered in a 
physiologically acceptable solution, for example, pH-buffered saline, viscous solutions such as 
those including glycerol or dextrose, or in solutions that include other components to support bone 
5 growth. Preferred bone growth components comprise bone fragments, ground bone, and matrix 
materials including calcium sulfate, hydroxyapatite, ultrahigh molecular weight polyethylene 
(UHMWPE), and proteins such as collagen. COVI polypeptides in physiologically acceptable 
solution are delivered locally or systemically (as the case dictates) by methods including but not 
limited to injection, catheter delivery, or direct implantation (U.S. Patent 6,034,062 which 

10 disclosures are hereby incorporated by reference in their entirety). 

A further embodiment of the invention is a method of contacting a bone growth- 
stimulating amount of COVI polypeptide with cells to facilitate integration of bone, for example, 
for purposes of bone transplantation in cases of dental implants, orthopedic prosthesis, or other 
surgical procedures. Preferred cells are those present in bone tissue, including but not limited to 

15 osteoblasts, osteocytes, and their precursors. COVI polypeptides are delivered in a physiologically 
acceptable solution, for example, pH-buffered saline, viscous solutions such as those including 
glycerol or dextrose, or in solutions that include other components to support bone growth. 
Preferred bone growth components comprise bone fragments, ground bone, and matrix materials 
including calcium sulfate, hydroxyapatite, UHMWPE, and proteins such as collagen. COVI 

20 polypeptides in physiologically acceptable solution are delivered to the site of desired bone 

integration by methods comprising injection or direct addition to the integrated tissue (U.S. Patent 
6034062, which disclosure is hereby incorporated by reference in its entirety). 

A further embodiment of this invention is a method of contacting a growth-stimulating 
amount of COVI polypeptide with cells to facilitate bone growth for example, for purposes of 

25 transplantation. Preferred cells include bone cells. Further preferred cells include but are not 
limited to human osteoblast cells, for example the cell lines MG63 or C2C12 or osteoblasts 
purified directly from bone, or their progenitors, such as those purified from bone marrow stroma 
or mesenchymal stem cells. Preferred culture conditions are common to the art and can include but 
are not limited to other factors to promote bone formation, for example bone or composite matrices 

30 to direct shaping, ascorbic acid, beta-glycerophosphate, dexamethasone, calcium salts, and 
collagen [Dean, D., et al. (2001) J. Orthop. Res. 19:179-86 and Buttery, L., et al. (2001) Tissue 
Eng. 7:89-99, which disclosures are hereby incorporated by reference in their entireties]. A 
preferred method comprises the steps: contacting COVI polypeptide directly with cells in culture; 
harvesting mineralized bone formation; and surgically implanting newly formed bone into desired 

35 location (U.S. Patent 4950296, U.S. Patent 5385566, and U.S. Patent 6200324, which disclosures 
are hereby incorporated by reference in their entireties). Another preferred method comprises the 
steps: delivering polynucleotides to cells in culture; delivering cells to sites of desired bone growth 
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(for example, to the site of a fracture or to an osteopenic bone). Preferred polynucleotides 
comprise polynucleotides encoding CO VI polypeptide operably linked to an expression control 
unit (e.g., a promoter) that will deliver a bone growth-stimulating amount of CO VI expression (for 
example, high, constitutive expression from the CMV promoter or regulated expression from a 

5 tetracycline-repressible promoter, both of which are readily commercially available). Said 
polynucleotides are delivered to cells in vitro or in situ by methods common to the art such as 
electroporation, calcium phosphate transfection, or adenoviral transduction [Maniatis, T., et al. 
Molecular Cloning A Laboratory Manual, Cold Spring Harbor Laboratory (1982) and Cheng, S., et 
al. (2001) Calcif. Tissue Int. 68:87-94, which disclosures are hereby incorporated by reference in 

10 their entireties]. Cells are introduced to a site of desired bone growth in vitro, in situ, or in vivo by 
methods comprising injection, introduction through a catheter, or surgical implantation of a cell- 
containing stent, for example, on an osteopenic bone (U.S. Patent 6034062 and U.S. Patent 
6206914, which disclosures are hereby incorporated by reference in their entireties). 

COVI is associated with vascular smooth muscle cells (VSMC) in the ECM. The COVI 

15 splice variant has enhanced ability to promote vascular matrix remodeling, i.e., formation of new 
vessels (e.g., during development or tissue expansion), and healing of damaged vessels such as 
those resulting from injury, incision, burns, disease, cardiac infarction, ulcers, diabetic ulcers, and 
chronic conditions such as atherosclerosis. A preferred embodiment of the invention is a method 
to promote vascular remodeling by contacting a vascular remodeling-stimulating amount of COVI 

20 polypeptide with cells. Preferred cells include but are not limited to VSMC, vascular epithelial 
cells, and fibroblasts. Further preferred cells include but are not limited to human VSMC, vascular 
epithelial cells, and fibroblasts in intact tissue (i.e., in a milieu of ECM proteins such as collagen). 
COVI polypeptides are delivered to cells in physiologically acceptable solution, for example, pH- 
buffered saline or viscous solutions such as those including glycerol or dextrose. Said solution 

25 may be applied topically to surface wound tissue in the treatment of ulcers, lesions, injuries, 
diabetic ulcers, burns, trauma, stasis ulcers, periodontal conditions, lacerations, and other 
conditions. In addition, intraperitoneal wound tissue such as that resulting from invasive surgery 
may be treated with a physiologically acceptable solution comprising COVI polypeptides to 
accelerate vascular remodeling. For example, the surgical plane may be coated with said solution 

30 prior to closing the surgical site to facilitate internal capillary perfusion and healing. In addition, . 
the rate of localized healing may be increased by the subdermal administration of said solution by 
methods common to the art such as injection (U.S. Patent 6,096,709, which disclosure is hereby 
incorporated by reference in its entirety). 

Timely vascular remodeling is an urgent factor in the case of cardiac infarction to prevent 

35 enlargement of the organ. A further preferred embodiment of the invention is a method of 

contacting a vascular remodeling-stimulating amount of COVI polypeptide with cells. The method 
comprises the step of contacting COVI polypeptides with cells by implantation of a COVI 
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polypeptide- releasing stent, for example surgically or via catheter (U.S. Patent 5,500,013 and U.S. 
Patent 5,449,382, which disclosures are hereby incorporated by reference in their entireties). 
Preferred cells include but are not limited to those found in cardiac tissue damaged as a result of 
infarction or within vessels for treating various problems such as atherosclerosis, stenonses, 
5 strictures, or aneurysms to reinforce collapsing, partially occluded, or weakened sections. 

A further preferred embodiment of the invention is a method to promote vascular 
remodeling by delivering polynucleotides encoding CO VI polypeptides to cells. This method is 
directed toward purposes such as transplantation of cells expressing COVI polypeptides. Preferred 
cells include but are not limited to VSMC, vascular epithelial cells, and fibroblasts. Further 

10 preferred cells include but are not limited to human VSMC, vascular epithelial cells, and 
fibroblasts, preferably in intact tissue (i.e., in a milieu of ECM proteins such as collagen). 
Preferred polynucleotides comprise polynucleotides encoding COVI polypeptides operably linked 
to an expression control unit (e.g., a promoter) that will deliver a vascular remodeling-stimulating 
amount of COVI expression (for example, high, constitutive expression from the CMV promoter 

15 . or regulated expression from a tetracycline-repressible promoter, both of which are readily 

commercially available). Said polynucleotides are delivered to cells in vitro or in situ by methods 
common to the art such as electroporation, calcium phosphate transfection, or adenoviral 
transduction [Maniatis, T., et al., Molecular Cloning A Laboratory Manual, Cold Spring Harbor 
Laboratory (1982) and Cheng, S., et al. (2001) Calcif. Tissue Int. 68:87-94, which disclosures are 

20 hereby incorporated by reference in their entireties]. Further included in the method is a step of 
delivering said cells to a desired site of vascular remodeling (including but not limited to wounds, 
incisions, injuries, ulcers, and diseased or otherwise hypovascular lesions) by methods common to 
the art such as injection or catheter delivery of cell suspensions or surgical implantation of intact 
tissue endoscopically or invasively (U.S. Patent 5,669,925 and U.S. Patent 5,683,345, which 

25 disclosures are hereby incorporated by reference in their entireties). 

COVI polypeptide is also present as a highly modified keratan sulfate proteoglycan (KSPG) in the 
cornea. KSPG's are associated with ECM proteins in the cornea and function to maintain corneal 
shape and opacity. In a further embodiment of the invention, a cornea-maintaining effective 
amount of COVI polypeptide is used in a method for maintaining a desired shape (e.g., following 

30 laser surgery or non-invasive orthokeratological procedures) or opacity of corneal tissues (e.g., at 
the onset of cataract formation). This method comprises the step of contacting COVI polypeptides 
with the ECM of the cornea in a physiologically acceptable solution. A preferred physiologically 
acceptably solution includes pH-buffered saline. Preferred method of contact is by an eye-drop 
mechanism (P.C.T. 001 19386, which disclosure is hereby incorporated by reference in its entirety). 

35 Protein of SEQ ID NO:4 (Internal designation Clone 1000848582_181-40-4-0-All-F) 

The cDNA of clone 1000848582J81-40-4-0-A11-F (SEQ ID NO:3) encodes the protein 
of SEQ ID NO:4 comprising the amino acid sequence 
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MELALRRSPWRWLLLLPLLL 

ATNSCKNFSELPLVMWLQGGPGGSSTGFGNFEEIGPLDSDLKPRKTTWLQ 
VGTGFSYVNGSGAYAKDLAMV ASDMMVLLKTFFSCHKEFQTWFY1FSES YGGKMAAGI 
GLELYKAIQRGTIKCNFAGVALGDSWISPVDSVL^WGPYLY 
5 VLNAVNKGLYREATELW 

Accordingly it will be appreciated that all characteristics and uses of polypeptides of SEQ ID NO:4 
described throughout the present application also pertain to the polypeptides encoded by the 
nucleic acids included in Clone 1000848582_181-40-4-0-Al 1-F. In addition, it will be appreciated 
that all characteristics and uses of the polynucleotides of SEQ ID NO:3 described throughout the 
10 present application also pertain to the nucleic acids included in Clone 1 0008485 82_1 8 1-40-4-0- 
Al 1-F. A preferred embodiment of the invention is directed toward the compositions of SEQ ID 
NO:3, SEQ ID NO:4, and Clone 1000848582_181-40^-0-Al 1-F. Also preferred are polypeptide 
fragments having a biological activity as described herein and the polynucleotides encoding the 
fragments. 

15 The protein of SEQ ID NO:4 encodes a novel serine carboxypeptidase designated here 

serine carboxypeptidase hx (SCPhx). SCPhx has a unique C-terminal sequence of 3 1 amino acids 
comprising KRGNTQRLACL AFSGG YRAHG WCLQTWSLH . This unique sequence within 
SCPhx contributes the histidine of the catalytic triad. SCPhx cleaves the peptide bond between the 
penultimate and C-terminal amino acid residues of its protein or peptide substrate and, in so doing, 
20 can either activate or inactivate the biological function of the substrate. 

A preferred embodiment of the invention is directed to compositions comprising the amino 
acid sequence of SEQ ID NO:4 (SCPhx) or fragments thereof. 

Further preferred is a method to use the serine carboxypeptidase activity of compositions 
comprising SCPhx polypeptide for biosynthetic procedures. Further preferred is an application of 
25 said method wherein a recombinant polypeptide engineered with a protective but inactivating C- 
terminal amino acid is activated through removal of this amino acid by SCPhx. 

Further preferred is a method to use the serine carboxypeptidase activity of compositions 
comprising SCPhx polypeptide for analytical procedures. Further preferred is an application of 
said method wherein the requirement for the C-terminal amino acid for the function of a given 
30 protein is determined through removal of the amino acid by SCPhx. 

The serine carboxypeptidase activity of SCPhx confers on SCPhx antifibrinolytic activity. 
In a further embodiment, compositions of the invention comprised of SCPhx are used in methods 
wherein the antifibrinolytic activity of SCPhx is used to promote wound healing. In further 
preferred embodiment, the composition is used in methods of stabilizing blood clots at sites where 
35 there is a breach in the vasculature by contacting a wound or injured tissue with a regenerative- 
effective amount of compositions of the invention. 

In a further embodiment of the invention, SCPhx is used in a method for antibody-directed 
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enzyme prodrug therapy (ADEPT). In said method, in vivo localization of SCPhx serine 
carboxypeptidase activity is effected through conjugation of SCPhx to specific antibody. Injection 
of SCPhx-antibody conjugate in conjunction with prodrug (drug-alpha-peptide) (Shi, P.T., et al., 
Yao Xue Bao 32: 106-9 (1997) which disclosure is hereby incorporated by reference in its entirety) 

5 results in localized activation of the drug. 

In said method for ADEPT, a preferred embodiment of the invention is directed to 
compositions comprising SCPhx conjugated to tumor-reactive antibody [Napier, M.P., et al., Clin. 
Cancer Res. 6:765-72 (2000) which disclosure is hereby incorporated by reference in its entirety]. 
In further preferred embodiment, SCPhx is conjugated to antibody reactive with carcinoembryonic 

10 antigen (CEA) and is used in conjunction with methotrexate prodrug for the treatment of colorectal 
' carcinoma. 

In a further preferred embodiment, the present invention provides for an antibody that 
binds SCPhx with or without neutralization of SCPhx serine carboxypeptidase activity. The 
antibody may be monoclonal or polyclonal. Preferred compositions comprise the SCPhx antibody. 

15 SCPhx serine carboxypeptidase activity expressed by breast cancer cells can activate 

autocrine neuropeptide growth factors concomitantly expressed by the tumor cells. In further 
embodiment of the invention, neutralizing anti-SCPhx antibody is used by intravenous injection to 
. suppress tumor growth by blocking the activation of autocrine growth factors by SCPhx 
constitutively expressed by the tumor. In further preferred embodiment, said method is used for 

20 the treatment of breast cancer. Li further preferred embodiment, said method is used for the 
treatment of cancer of the salivary gland. 

SCPhx serine carboxypeptidase activity can process beta-amyloid precursor protein and 
generate beta-amyloid. In further embodiment of the invention, neutralizing anti-SCPhx antibody 
is used by injection in Alzheimer's disease to block processing of beta-amyloid precursor protein 

25 and generation of beta-amyloid. 

Daily administration of a very low dose of the polypeptide gAcrp30 to mice consuming a 
high-fat/sucrose diet causes profound and sustainable weight reduction without affecting food 
intake (Fruebis, J., et al., Proc. Natl. Acad. Sci. USA 98:2005-10 (2001) which disclosure is hereby 
incorporated by reference in its entirety). Said activity of gAcrp30 is abrogated by SCPhx serine 

30 carboxypeptidase activity. In a preferred embodiment of the invention, compositions comprising 
said neutralizing SCPhx antibody are used in methods to block in vivo inactivation of polypeptide 
function by SCPhx serine carboxypeptidase activity. In further preferred embodiment, 
compositions comprising said neutralizing SCPhx antibody are used in methods to treat obesity in 
humans by intravenous injection concomitant with human gAcrp30. In further preferred 

35 embodiment, compositions comprising said neutralizing SCPhx antibody are used in methods to 
treat obesity in other mammals by intravenous injection concomitant with mammal or human 
gAcrp30. 
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The invention further relates to a method of screening for test compounds that bind and/or 
inhibit SCPhx serine carboxypeptidase activity above comprising the steps of contacting an SCPhx 
polypeptide with said test compound and detecting or measuring whether said test compound binds 
said SCPhx polypeptide. Alternatively, the method comprises the steps of contacting an SCPhx 
5 polypeptide with substrate of said SCPhx polypeptide in the presence of test compound and 
detecting or measuring the release of the C-terminal amino acid from said SCPhx substrate, 
wherein a difference in the amount of said release relative to the amount of release in the absence 
of the test compound modulates, preferably inhibits, the serine carboxypeptidase activity of 
SCPhx. 

10 Protein of SEQ ID NO:8 (Internal designation Clone 1000770704_208-27-3-0-G6-F) 

The cDNA of clone 1000770704_208-27-3-0-G6-F (SEQ ID NO:7) encodes the protein of ■ 
SEQ ID NO: 8 comprising the amino acid sequence 
MRLPAQLLGLLMLWVSGSS 

YHQKPGQSPQLLIYLGSNRASGWDRFSGSGSGTOFTL 
15 TFGPGTRVDKRWAAPSVF^ 

GNSQESVTEQDSKDSTYSLSSTLTLSK 

Accordingly it will be appreciated that all characteristics and uses of polypeptides of SEQ ID NO: 8 
described throughout the present application also pertain to the polypeptides encoded by the 
. nucleic acids included in Clone 1000770704_208-27-3-0-G6-F. In addition, it will be appreciated 

20 that all characteristics and uses of the polynucleotides of SEQ ID NO:7 described throughout the 
present application also pertain to the nucleic acids included in Clone 1000770704_208-27-3-0- 
G6-F. A preferred embodiment of the invention is directed toward the compositions of SEQ ID 
NO:7, SEQ ID NO:8, and Clone 1000770704_208-27-3-0-G6-F. Also preferred are polypeptide 
fragments having a biological activity as described herein and the polynucleotides encoding the 

25 fragments. 

The protein of SEQ ID NO: 8 encodes the polypeptide CalX, which binds parathyroid 
hormone related protein (PTHrP), a hormone involved in bone metabolism. 

PTHrP was initially discovered as a tumor-derived systemic factor that causes humoral 
hypercalcemia of malignancy (HHM). PTHrP is now known to play a major role in HHM. It has 

30 been identified as the major causative agent in tumors that were previously thought to cause 

hypercalcemia through skeletal metastatic involvement. Hypercalcemia is the most common life- 
threatening metabolic disorder associated with neoplastic diseases, occurring in an estimated 10% 
to 20% of all persons with cancer. That PTHrP is not just a bystander but is the cause of the 
hypercalcemia is indicated by the observation that in animals with hypercalcemia caused by 

35 xenografts of human tumors, the infusion of neutralizing antibodies to PTHrP reverses the 
hypercalcemia. 

CalX binds to and neutralizes the activity of PTHrp, including the induction of HHM. 
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A preferred embodiment of the invention is directed to comprising the amino acid sequence of 
SEQ ID NO: 8 (CalX). Further included in the invention are fragments of full-length CalX having 
a biological activity described herein as well as the polynucleotides encoding these fragments. 

In a preferred embodiment, compositions of the invention are used in methods to neutralize 
5 PTHrP, wherein compositions comprising CalX are contacted with and thereby block PTHrP 
activity. A further embodiment is directed toward a method to use compositions of CalX to 
suppress HHM. In further preferred embodiment, CalX is used to suppress HHM associated with 
breast cancer, pancreatic adenocarcinoma, prostate cancer, squamous cell carcinoma of lung, renal 
cell carcinoma, ovarian carcinoma, and T cell leukemia/lymphoma. 

10 It is believed that PTHrP plays a role in the pathophysiology associated with osteoarthritis. 

In further preferred embodiment, CalX is used in a method to suppress bone resorption within an 
affected joint, preferably in the synovium of a joint capsule. Said methods comprise contacting 
CalX compositions with the synovial fluid of the joint capsule. Preferred delivery of CalX 
includes injection or transdermal contact at the site of the joint. 

15 It is believed that PTHrP plays a role in the pathophysiology associated with rheumatoid 

arthritis. In further preferred embodiment, CalX is used in a method to decrease inflammation 
within an affected joint, preferably in the synovium of a joint capsule. In further preferred 
embodiment, CalX is used in a method to decrease bone resorption within an affected joint, 
preferably in the synovium of a joint capsule. Said methods comprise contacting CalX 

20 compositions with the synovial fluid of the joint capsule. Preferred delivery of CalX includes 
injection or transdermal contact at the site of the joint. 

Protein of SEQ ID NO:6 (Internal designation Clone 1000839315_220-26-l-0-F3-F) 

The cDNA of clone 100083 93 15_220-26-l-0-F3-F (SEQ ID NO:5) encodes the protein of 
SEQ ID NO:6 comprising the amino acid sequence: 

25 MKFFVTALVLALMSMK 

NLLYTLCFRELAFSIVT. Accordingly it will be appreciated that all characteristics and uses of 
polypeptides of SEQ DO NO:6 described throughout the present application also pertain to the 
polypeptides encoded by the nucleic acids included in Clone 1000839315_220-26-l-0-F3-F. In 
addition, it will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 

30 NO:5 described throughout the present application also pertain to the nucleic acids included in 
Clone 1000839315_220-26-l-0-F3-F. A preferred embodiment of the invention is directed toward 
the compositions of SEQ ID NO:5, SEQ ID NO:6, and Clone 10008393 15_220-26-l-0-F3-F. Also 
preferred are polypeptide fragments having a biological activity as described herein and the 
polynucleotides encoding the fragments. 

35 The protein of SEQ ID NO:6 encodes Chimerin, a chimeric polypeptide encoded by an 

exon derived from the histatin 1 gene spliced downstream onto an exon derived from the linked 
statherin gene. Specifically, an exon encoding the N-terminal amino acids of both histatin 1 and 
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Chimerin (MKFFWALVLALMISMISADSHEKRHHGYRRKFM is spliced onto a 

statherin-derived exon that encodes the novel C-terminal amino acids of Chimerin 

(mTLLPLFEESSKSNANEKHYNLLYTLCFRILAFSIV^ but in contradistinction entirely 3'- 

untranslated nucleotide sequence in statherin mRNA. 
5 Chimerin is a low molecular weight, histidine-rich salivary polypeptide. Chimerin 

functions as part of the nonimmune host defence system in the oral cavity. 

Chimerin possesses broad spectrum antifungal activity, including that against the 

pathogenic yeast Candida alibcans, with minimal cytotoxicity towards normal host cells, 

suggesting its high potential as a novel anti-fungal therapeutic agent. Chimerin also possesses anti- 
10 bacterial activity, including that against Streptococcus mutans strains and the 

periodontopatheogenic Porphyromonas gingivalis. A great benefit of Chimerin is that to date no 

resistant fungal strains have been demonstrated and moreover, that Chimerin can be hydrolyzed in 

a natural way in the digestive tract. Therefore, Chimerin might be applied for long term use, 

intermitting the application of antibiotics. 
15 . A preferred embodiment of the invention is directed to compositions comprising the amino 

acid sequence of SEQ NO:6 (Chimerin) 

MKFFWALVLALM^ 

NLLYTLCFRILAFSIVT. 

Further included in the invention are fragments of the full-length Chimerin polypeptide 
20 having a biological activity described herein as well as the polynucleotides encoding these 

fragments. Preferred fragments with biological activity include the amino acid sequence 

comprising 

DSHEKRHHGYRRKFHEKHHSYHITLLPIJFE or 
DSHEKRHHGYRR or 

25 KFHEKHHSYHITLLPLFEESSK^ 

Further preferred is a method to use formulations comprising Chimerin in a 
physiologically compatible solution as further described in US Patent 4,725,576 ("Fungicidal 
polypeptide compositions containing L-histidine and methods for use therefore") and incorporated 
be reference in its entirety, including but not limited to the incorporation of Chimerin into a mouth 

30 wash. 

Further preferred is a method to use compositions comprising Chimerin as agents with 
which to treat a fungal or bacterial infection as further described in US Patent 5,912,230 ("Anti- 
fungal and anti-bacterial histatin-based peptides") and incorporated by reference in its entirety. 
The said method is comprised of contacting said fungi and bacteria with an effective amount of 
35 Chimerin polypeptide of the present invention. Said method for treating a fungal or bacterial 
infection of claim is applicable when said fungal or bacterial infection is selected from the group 
consisting of: (a) an infection of the oral cavity; (b) an infection of the vagina; (c) an infection of 
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the urethra; (d) an infection of the ear; (e) an infection of the skin; (f) a respiratory infection; (g) a 

mucosal infection; (h) an ophthalmic infection; and (i) systemic infection. 

Further preferred is a method to use compositions comprising Chimerin as described as 

agents with which to prevent recurring fungal or bacterial infection in patients including, but not 
5 limited to, those from the group consisting of: AIDS patients; diabetics; and xerostomia patients, 

including patients with Sjogren's syndrome and those patients whose salivary gland function has 

been compromised as a result of radiation therapy. 

Further preferred is method to use compositions comprising Chimerin for treating a fungal 

or bacterial infection wherein the fungus or bacterium is selected from the group consisting of: 
10 (a) Candida albicans; (b) Actinomyces actinomycetemcomitans; (c) Actinomyces viscosus; 

(d) Bacteroides forsythus; (e) Bacteriodes fragilis; (f) Bacteriodes gracilis; (g) Bacteriodes 

ureolyticus; (h) Campylobacter concisus; (i) Campylobacter rectus; (j) Campylobacter showae; 

(k) Campylobacter sputorum; (1) Capnocytophaga gingivalis; (m) Capnocytophaga ochracea; 

(n) Capnocytophaga sputigena; (o) Clostridium histolyticum; (p) Eikenella corrodens; 
15 (q) Eubacterium nodatum; (r) Fusobacterium nucleatum; (s) Fusobacterium periodonticum; 

(t) Peptostreptococcus micros; (u) Poiphyromonas endodontalis; (v) Porphyromonas gingivalis; 

(w) Prevotella intermedia; (x) Prevotella nigrescens; (y) Propionibacterium acnes; 

(z) Pseudomonas aeruginosa; (aa) Selenomonas noxia; (bb) Staphylococcus aureus; 

(cc) Streptococcus constellate; (dd) Streptococcus gordonii; (ee) Streptococcus intermedius; 
20 (ff) Streptococcus mutans; (gg) Streptococcus oralis; (hh) Streptococcus pneumonia; 

(ii) Streptococcus sanguis; (kk) Treponema denticola; (11) Treponema pectinovorum; 

(mm) Treponema socranskii; (nn) Veillonella parvula; and (oo) Wolinella succinogenes. 

The compositions and methods for treatment of fungal and bacterial infections discussed 

above are not limited to use in humans, but can have veterinary applications as well. 
25 In a further preferred embodiment, the present invention provides for an antibody that 

specifically binds Chimerin. The invention further relates to a method of screening for antibodies 

that specifically bind Chimerin comprising the steps of contacting the unique C-terminal 39 amino 

acids of Chimerin (YHTIIXPLFEESSKSNAISK^ with said test 

antibody and detecting or measuring whether said test antibody binds said Chimerin polypeptide. 
30 Further preferred is a method to use compositions comprising this antibody in diagnostic assays to 

measure Chimerin concentration in bodily fluids, including saliva. 

Further preferred is a method to use compositions comprising this antibody to specifically 

purify Chimerin from bodily fluids, including saliva, or from recombinant sources utilizing 

compositions comprising the nucleotide sequence of SEQ NO: 5 (Chimerin) or fragments thereof. 
35 Protein of SEQ ID NO:2 (Internal designation Clone 223583 JU4-044-2-0-E11-F) 

The cDNA of clone 223583 J 14-044-2-0-E1 1-F (SEQ ID NO:l) encodes the protein of 

SEQ ID NO:2 comprising the amino acid sequence: 
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MAACQLLLEITTFLRETFSCLPRPRTEPLVASTDHTKMPSQM 
GYLTKEDLRVLMEKEFPGFLENQK^ 

NDYFVVHMKQKGKK. Accordingly it will be appreciated that all characteristics and uses of 
polypeptides of SEQ ID NO:2 described throughout the present application also pertain to the 
5 polypeptides encoded by the nucleic acids included in Clone 223583_1 14-044-2-0-E 1 1 -F. In 
addition, it will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NO:l described throughout the present application also pertain to the nucleic acids included in 
Clone 223583_1 14-044-2-0-E1 1-F. A preferred embodiment of the invention is directed toward 
the compositions of SEQ ID NO: 1, SEQ ID NO:2, and Clone 2235 83_1 14-044-2-0-E1 1-F. Also 

10 preferred are polypeptide fragments having a biological activity as described herein and the 
polynucleotides encoding the fragments. 

The protein of SEQ ID NO:2 encodes S-100A 10 Related Protein (S-lOOAlOrP), which is a 
splice variant of S-100A. Specifically, the protein of SEQ ID NO:2 encodes the S-100A10 
polypeptide preceded by a unique sequence of 37 amino acids at the amino terminus comprising 

15 MAACQLLLErTTFLRETFSCLPRPRTEP LVASTDHTK. 

Dimeric S-100A10 can associate with dimeric annexin II to form a heterotetramer. As a 
component of this heterotetramer, S-100A10 can mediate a number of activities at the cell surface 
(Kassam G., et ah, Biochemistry 37:16958-66 (1998), Mai, J., et al., J. Biol. Chem. 275:12806-12 
(2000) which disclosures are hereby incorporated by reference in their entirety). S-lOOAlOrP 

20 antagonizes these activities. 

Heterotetrameric annexin II at the cell surface promotes the generation of plasmin, a serine 
protease with broad substrate specificity, through its association with both plasminogen and tissue 
plasminogen activator. The promotion of plasmin generation by annexin H plays a role in: 
(i) control of hemostasis and coagulation, (ii) macrophage migration and matrix 

25 remodeling,(iii) neuronal cell differentiation, (iv) tumor cell invasion and metastasis, and 
(v) cardiovascular development and angiogenesis. 

A preferred embodiment of the invention is directed to compositions comprising the amino 
acid sequence of SEQ NO:2 (S-lOOArP). Further preferred embodiment of the invention is 
directed to compositions comprising either monomeric or dimeric S-lOOAlOrP. Further included 

30 in the invention are fragments of the full-length S-lOOAlOrP polypeptide having a biological 
activity described herein as well as the polynucleotides encoding these fragments. 

Further preferred is a method to use compositions comprising S-lOOArP to suppress 
plasmin generation and thereby decrease inflammation at sites of chronic inflammation, preferably 
in the synovium of a joint capsule. Said methods comprise contacting S-lOOAlOrP compositions 

35 with the synovial fluid of the joint capsule. Preferred delivery of S-lOOAlOrP includes injection or 
transdermal contact at the site of the joint. 

Preferred is a method to use compositions comprising S-lOOArP to suppress tumor cell 
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metastasis. Further preferred is an embodiment of the method directed to the use of compositions 
of S-lOOAlOrP to suppress tumor cell metastasis facilitated by the binding of the cysteine protease 
cathepsin B to cell surface hetertetrameric annexin II. Said method is comprised of contacting said 
tumor cells with an effective dose of S-lOOAlOrP by injection. Further preferred is an 
5 embodiment of the method directed to the use of S-lOOAlOrP to suppress the metastasis of breast 
cancer. Further preferred in an embodiment of the method directed to the use of S-lOOAlOrP to 
suppress the metastasis of glioma. 

Preferred is a method to use compositions comprising S-lOOArP to suppress inflammation 
associated with wound healing. Further preferred are compositions comprised S-lOOArP used in 

10 methods of treatment comprised of contacting a wound or injured tissue with an ameliorative 
effective amount by injection or transdermal contact at the site of the wound. 

Acute promyelocyte leukemia (APL) is characterized by hyperfmbrinolysis due to 
heterotetrameric annexin II promoted plasmin generation and a consequential disseminated 
intravascular coagulation. In a preferred embodiment of the invention, S-lOOAlOrP is used to 

15 suppress this hyperfibrinolysis. Said method is comprised of contacting APL cells with an 
effective amount of S-lOOAlOrP by injection. 

A preferred embodiment of the invention is to use compositions comprising S-lOOAlOrP 
in a method to suppress angiogenesis associated with the growth of solid tumors. Further preferred 
is a method to use compositions comprising S-lOOAlOrP to suppress angiogenesis associated with 

20 breast cancer, prostate cancer, pancreatic adenocarcinoma, colorectal cancer, renal cell carcinoma, 
squamous cell carcinoma of the lung, and T cell lymphoma. Preferred delivery includes contacting 
the tumor with an effective amount of S-l OOAlOrP by intravenous injection. 

A preferred embodiment of the invention is to use compositions comprising S-lOOAlOrP 
in a method to suppress angiogenesis associated with chronic inflammation. Further preferred is a 

25 method to use compositions comprising S-lOOAlOrP to suppress angiogenesis associated with 
rheumatoid arthritis and thereby decrease inflammation, preferably in the synovium of a joint 
capsule. Said methods comprise contacting S-lOOAlOrP compositions with the synovial fluid of 
the joint capsule. 

In a further preferred embodiment, the present invention provides for an antibody that 
30 specifically binds an S-lOOAlOrP polypeptide of the present invention in a method of neutralizing 
S-lOOAlOrP function and thereby up-regulating the functional activity of extracellular 
heterotetrameric annexin DL Further preferred is a method to use compositions comprising this 
antibody to promote angiogenesis in ischemic heart tissue. Preferred delivery includes contacting 
the heart tissue with an effective amount of anti-S-lOOAlOrP antibody by intravenous injection. 
35 Further preferred is a method to use compositions comprising anti-S-1 0OA1 OrP antibody to 

promote neuritogenesis in ischemic brain tissue. Preferred delivery includes contacting the neural 
tissue with an effective amount of anti-S-lOOAlOrP antibody by local injection or transdermal 
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contact. 

Protein of SEQ ID NO:32 (Internal designation clone 477709JL 74-8-2-0-C10-F) 

The cDNA of Clone 477709_174-8-2-0~C10-F (SEQ ID NO:31) encodes the protein of SEQ ID 
NO:32 comprising the amino acid sequence : 
5 MAWRGWAQRGWGCGQAWGASVGGRSCEELTAVLTPPQLL^ 
KVEPRRSDPGTSGEAYKRSALIPPVEETVF^ 
ESLKSRVQSYFDGKADWLDSIRPQKEGDFR]^ 
LmVPSLQRTMmYFTSNPASKVLCSPMLLSTFSOT 
EQFMAVYLSAG\TSNFVSYVGKVAT^^ 

10 LGGALFGIWYVTYGHELIWKNREPLVKIWHEIRTNGPK^ Accordingly, it will be 

appreciated that all characteristics and uses of polypeptides of SEQ ID NO:32 described 
throughout the present application also pertain to the polypeptides encoded by the nucleic acids 
. included in Clone 477709J74-8-2-0-C10-F. In addition, it will be appreciated that all 

characteristics and uses of the polynucleotides of SEQ ID NO:3 1 described throughout the present 

15 application also pertain to the nucleic acids included in Clone 477709_174-8-2-0-C10-F. A 
preferred embodiment of the invention is directed toward the compositions of SEQ ID NO:3 1, 
SEQ ID NO:32, and Clone 477709_174-8-2-0-C10-F. Also preferred are polypeptide fragments 
having a biological activity as described herein and the polynucleotides encoding the fragments. 

The protein of SEQ ID NO:32 encodes Pretactilin, a splice variant of the protein of EMBL 

20 entry Q9H300. The corresponding locus located on chromosome 3 possesses at least 2 known 
variants described in entries AAH03653 and Q9H300 in EMBL. The closest known sequence, 
both at the nucleotide and amino acid levels, is Q9H300. Q9H300 is split into 10 exons, of which 
the protein of the invention is missing exon 8, while in AAH03653, it is exon 6 that is absent. 

Pretactilin is a polypeptide that interacts with the carboxyl-terminus of presenilin-1 and 

25 presenilin-2. Pretactilin harbours six putative transmembrane domains and belongs to the family 
of transmembrane rhomboid like proteins that have been isolated from various organisms, ranging 
from bacteria, plants, invertebrates to humans. The first isolated member of this family, the 
Drosophila melanogaster Rhomboid protein, is a seven transmembrane domain protein that has 
been implicated in Epidermal Growth Factor Receptor (EGFR) signaling, which as in mammals 

30 controls many aspects of growth and development. Genetic evidence indicates that Rhomboid 
controls the activation by proteolysis of the transmembrane EGFR ligand, Spitz, a TGFa-like 
molecule presents at the surface of neighbouring cells, to generate an active diffusible form of the 
ligand. 

The rhomboid domain of the Pretactilin extends from amino acid positions 186 to 323, and 
35 includes the predicted transmembrane domain region. It has been recently proposed by Pellegrini 
et al., 2001, J. Alzheimers Dis. 3 (2) which disclosure is hereby incorporated by reference in its 
entirety, that the members of the Rhomboid superfamily possess a metal-dependent protease 
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activity. 

The familial Alzheimer disease gene products, presenilin-1 and presenilin-2, are multipass 
membrane proteins consisting of 6-8 spanning regions that undergo endoproteolytic processing 
within their large hydrophilic loop at their carboxyl terminus. Immunolocalization studies have 
5 demonstrated that these ubiquitously expressed molecules, primarily located to the endoplasmic 
reticulum and the golgi apparatus, are also found on nuclear and plasma membranes. The 
presenilin proteins have been reported to be functionally involved in amyloid precursor protein 
processing, notch receptor signalling, and programmed cell death, or apoptosis. 

Alzheimer's Disease (AD) is a devastating neurodegenerative disorder characterized by 

1 0 progressive memory and cognition impairment associated with an increase secretion and 

deposition of a 4 kDa beta amyloid peptide (A beta) in extracellular senile plaques in the brain. In 
both healthy and AD patients, A beta is derived by proteolytic cleavage from the single 
transmembrane amyloid precursor protein (APP) by various proteinases that have been called APP 
secretases. Alpha secretases cleave APP within the amyloid sequences, whereas other proteases 

1 5 called beta- and gamma-secretases cleave on the N- and C-terminal ends, respectively. While a 
transmembrane aspartyl protease, BACE, has been identified as beta-secretase and several 
proteases may be alpha-secretases (ADAM-10, TACE, PC7), the nature of the gamma-secretase(s) 
remains elusive. Recently, a number of studies have suggested that the presenilins themselves, 
missense mutations in which cause the most aggressive forms of familial AD with increased 

20 production of A beta, could be the long sought gamma-secretases which release A-beta. 

The presenilins family of proteins has also been shown to interact with the Notch 
signalling pathway by forming stable complexes with Notch and being required for its proper 
cleavage at the cell surface. Notch is a single transmembrane domain cell surface receptor that 
. mediates many cell fate decisions during development in both vertebrates and invertebrates. Notch 

25 is synthesized as a large precursor that is cleaved in the trans-golgi network lumen to generate two 
fragments that form a heterodimeric receptor at the cell surface. Following ligand receptor 
binding, the C-terminal transmembrane-intracellular fragment of Notch is cleaved within its 
transmembrane domain by an as yet unidentified protease. This ligand-activated cleavage releases 
the Notch intracellular domain from the membrane, allowing it to translocate to the nucleus where 

30 it affects the transcriptional activity of target genes through interactions with proteins that include 
members of the CSL family. 

In addition to their roles in APP processing and Notch receptor signaling, extensive 
evidence suggests that presenilins are also involved in programmed cell death. Over-expression of 
Presenilin-2 increases apoptosis induced by a number of apoptotic stimuli, whereas mutations in 

35 the presenilin genes as found in Familial Azheimer's Disease cases generate molecules with 
constitutive pro-apoptotic activity. Complementary studies have demonstrated that depletion of 
PS2 protein levels by antisense RNA protects cells against apoptosis induced by a number of cell- 
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death-inducing apoptotic stimuli. At the molecular level, it has been observed recently that the 
carboxyl-termini of presenilin-1 and presenilin-2 interact with Bcl-XL protein, an anti-apoptotic 
member of the Bcl-2 family, providing an additional link between these proteins and the apoptotic 
pathway. 

5 By virtue of its being either a transmembrane protease or a transmembrane protease 

cofactor, Pretactilin interacts physically with presenilins to form active complexes in the 
membranes that are involved in APP metabolism, Notch signalling and programmed cell death via 
specific protein processing. Specifically, Pretactilin contributes to the proteolytic processing of a 
number of protein substrates including APP and Notch. 

10 In one embodiment of the present invention, Pretactilin can be used in a protease cocktail 

in order to digest proteins, preferentially transmembrane proteins, from a biological sample. Use 
of a protease cocktail could be of particular interest either to quickly purify DNA from crude 
cellular extracts or to remove transmembrane and membrane-associated proteins in isolated 
membranes preparation in order to prepare protein-free membranes vesicles useful for protein 

15 reconstitution and functional assays in vitro. In a preferred embodiment, Pretactilin is added to a 
protease cocktail in combination with one or more presenilin proteins. 

In another embodiment, Pretactilin can be used as a transmembrane marker that would be 
useful during protein purification methods for monitoring the recovery of transmembrane proteins 
from a biological sample or from cells grown in vitro. In such methods, the proteins can be 

20 detected in any of a number of ways. For example, Pretactilin can be labeled and added to the 
sample or the cells prior to the purification step. Alternatively, Pretactilin can be recombinantly 
fused to a detectable protein such as GFP and expressed in the organism from which the sample 
will be taken, or in the cells, prior to purification. In addition, Pretactilin can be detected 
throughout the purification steps using a monoclonal or polyclonal antibody that specifically 

25 recognizes Pretactilin. 

The present invention also provides new methods to purify wild type and mutant presenilin 
proteins, preferentially human presenilins, consisting in using Pretactilin or fragments thereof to 
co-immunopurify presenilins from cellular extracts. Methods to co-immunopurify proteins are 
well known to those skilled in the art. For example, presenilins can be co-immunopurified by 

30 affinity column chromatography or by immobilisation on sepharose-beads with monoclonal or a 
polyclonal antibody that specifically binds Pretactilin. Such purified wild type and mutant 
presenilins would then be of particular interest to generate presenilin antibodies that could be used 
for the treatment of Alzheimer's disease. In addition, the purified presenilin polypeptides could 
subsequently be used for the diagnosis of Alzheimer's disease as described below. 

35 In a further embodiment, the present invention is used in a diagnostic method for detecting 

Alzheimer's disease in an individual comprising the steps of: 

(a) co-immunopurifying presenilins with Pretactilin from a biological sample, 
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(b) adding the corresponding purified polypeptides to membranes vesicles containing a 
reconstituted presenilins substrate, preferentially the Notch protein, as well as, 
optionally, a reconstituted Pretactilin, 

(c) quantifying protease activity of these membrane vesicles compared to reconstituted 

5 positive and negative controls (e.g., identical membrane vesicles where wild type and 

mutant presenilins have been incorporated, respectively), by proteolytic fragment 
detection and quantification. 
In another embodiment, the present invention provides new methods to identify other 
proteins that interact physically with presenilins and/or Pretactilin. In a preferred method, 
10 Pretactilin is used to co-immunopurify presenilin complexes from cellular extracts, preferentially 
from brain cellular extracts, then disrupting the isolated complexes in order to release its 
components and identifying the associated proteins, for example by microsequencing followed by 
gene cloning and characterisation. Alternatively, Pretactilin can be used as bait in two-hybrid 
experiments in yeast for the screening of interacting polypeptides. Because such interacting 
1 5 proteins would likely be also involved in the modulation of A beta peptide production, their 

characterisation would certainly lead to the identification of new genes whose mutations cause or 
predispose to Alzheimer's disease. They would also provide useful novel targets for gene and drug 
therapies of the disease. 

In a further embodiment, Pretactilin can be used in a method to locate presenilins in 
20 subcellular compartments of a cell, preferentially neuronal cells, comprising the steps of contacting 
an isolated sample of cells with labeled Pretactilin and detecting the labeling in those cells. 
Methods used for labeling proteins are well known in the art, any of which can be used in the 
present invention. 

Pretactilin also provides a method to restore normal APP processing in mutant cells 
25 producing increased level of A beta peptide by reducing the level or the activity of the present 
protein in the cells. This can be achieved using techniques well known in the art, for example 
using antibodies, antisense molecules, ribozymes, or administrating to said mutant cells small 
molecule inhibitors of Pretactilin. 

The present invention also provides an in vitro system useful to screen for inhibitors of A 
30 beta production that could be of particular interest either for the prevention or the treatment of 
Alzheimer's disease, consisting in transfecting cultured cells in vitro, preferentially brain cells, 
more preferentially neuronal cells, with a nucleotide sequence encoding Pretactilin placed under 
the control of a strong constitutive promoter sequence in order to achieve high expression level of 
Pretactilin in those cells, applying to the cells the substance to be tested, measuring the amount of 
35 A beta peptide produced by these cells compared to control transfected cells. 

In another embodiment, Pretactilin can be used to modulate apoptosis of cells. For 
example, the level of Pretactilin can be increased in cells, preferentially in tumor cells, in vitro or 
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in vivo, thereby inducing apoptosis. The level or the activity of Pretactilin can be increased in any 
of a number of ways, including by administering purified Pretactilin to the cells, transfecting the 
cells with a polynucleotide encoding Pretactilin, or administering a compound to the cells that 
causes an increase in the activity or expression of Pretactilin. Alternatively, apoptosis can be 
5 inhibited by decreasing the level or the activity of Pretactilin in cells, for example using antibodies, 
antisense molecules, ribozymes, or small molecule inhibitors of Pretactilin. In a preferred 
embodiment, Pretactilin is used to inhibit apoptosis of neuronal cells in patients suffering of 
neurodegenerative diseases, preferentially, Alzheimer's disease. 

In another embodiment, the present invention provides a transgenic non-human animal, 

1 0 preferentially a mammal, more preferentially a rodent, producing high level of A beta peptide due 
to overproduction of Pretactilin. Such trangenic animal would provide a useful in vivo model to 
study the onset of Alzheimer's disease and more particularly to investigate the role of A beta 
peptide deposits in the etiology of the disease. It would also be of considerable interest for the 
screening of compounds that inhibit A beta peptide secretion or accumulation. Such transgenic 

1 5 animal can be obtained by any of the current methods used to generate transgenic animals that are 
well known for those skilled in the art, for example in the mouse, using DNA microinjection into 
fertilized eggs or transfection of embryonic stem cells. High over-expression of Pretactilin can be 
achieved by placing the nucleotide sequence encoding Pretactilin under the control of a strong 
promoter sequence. The promoter sequence can be derived from a gene having a broad expression 

20 in the animal or from a gene whose expression is restricted to the brain. Preferentially, a 
regulatable promoter sequence is used in order to control temporally the expression of the 
transgene once introduced into the animal. 

In another embodiment, the level or the activity of Pretactilin can be modulated to provide a 
treatment for Alzheimer's disease in a patient. Indeed as A beta peptide deposition is an early and 

25 invariant event in Alzheimer's disease, it is believed that a treatment that affects A beta production 
will be useful in the treatment of the disease. Accordingly, reducing level or activity of Pretactilin 
in mutant cells would thereby diminish A beta production. This could be achieved by any of the 
well known strategies used for therapy in vivo, for example using antisens molecules, antibody or 
small molecule inhibitors of Pretactilin. 

30 Protein of SEQ ID NO 34: (Internal designation 145606 _106-023-2-0-B3-F): 

The cDNA of clone (SEQ ID NO:33) encodes the human MS4A5 protein, comprising the 
sequence: 

MDSSTAHSPWLVFPPEITASEYESTELSATTFSTQSP 
GVIFLFTLLKPYPR^ 
35 AGIILLTFGFILDQNYICGYS 

CEQCC (SEQ ID NO:34). Accordingly, it will be appreciated that all characteristics and uses of 
polypeptides of SEQ ID NO:34 described throughout the present application also pertain to the 
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polypeptides encoded by the nucleic acids included in clone 145606_106-023-2-0-B3-F. In 
addition, it will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NO:33 described throughout the present application also pertain to the nucleic acids included in 
clone 145606_106-023-2-0-B3-F. A preferred embodiment of the invention is directed toward the 
5 compositions of SEQ ID NO:33, SEQ ID NO:34, and Clone 145606J06-023-2-0-B3-F. Also 
preferred are polypeptide fragments having a biological activity as described herein and the 
polynucleotides encoding the fragments. 

The cDNA of SEQ ID NO:33 comprising 5 exons encodes the 200 amino-acid MS4A5 
protein (STR Q9H3 V2), which belongs to the MS4A protein family (membrane-spanning four- 

10 domains, subfamily A). Four members of MS4A family in human (MS4A4-7) and in mouse 
(MS4A8-1 1) have been described (Ishibashi K. et al, Gene (2001), 264, 87-93 which disclosure is 
hereby incorporated by reference in its entirety). As with the other members of the 
CD20/Fc(sigma)RI(beta)/HTm4 superfamily, all MS4A proteins are highly hydrophobic with four 
transmembrane domains (but are distinct from tetraspanin family members which also have four 

1 5 transmembrane domains). The cDNA of SEQ ID NO:33 encoding the protein of SEQ ID NO:34 
possesses a conserved sequence around the initiating methionine (ATC ATG G) and a consensus 
protein kinase A (PKA) phosphorylation site (KRKTT) at the intracellular loop between the second 
and third transmembrane domains. In contrast with other members of MS4A family, which are 
mostly expressed in lymphoid tissues, MS4A5 is expressed in testis, pancreas, and at low levels in 

20 the heart and brain. The gene of MS4A5 is located on human chromosome 1 1, specifically at 
position llq!2, the same chromosome as the CD20, Fc(sigma)RI(beta) and HTm4 genes. MS4A5 
is a novel transmembrane protein that acts alone or in combination with other proteins as an ion 
channel, e.g. a ligand-gated calcium channel. MS4A5 is involved in a number of cellular functions 
in non-lymphoid cells, for example intracellular signaling, regulating intracellular calcium 

25 concentrations, exocrine functions, and endocrine functions. 

In one embodiment, the protein of the invention or fragment thereof provides a method to 
detect cells specifically expressing the present protein, using for example flow cytometry 
technology or classical in situ detection techniques which are well known in the art. Such methods 
are useful, e.g. to specifically detect cells of the testis, pancreas, heart, or brain, as the present 

30 protein is highly expressed in these cell types. Such methods are also useful to detect cells over- or 
under-expressing the present protein, and is thus useful for diagnosing diseases or conditions 
resulting from or associated with an increase or decrease in expression or activity of the protein. 
This method includes the steps of contacting a biological sample obtained from an individual 
suspected of suffering from the disease or condition, or at risk of developing the disease or 

35 condition, with a compound capable of selectively binding the present protein or nucleic acids, e.g. 
an antibody directed against the present protein or a polynucleotide probe directed against the 
present cDNA. Following this binding step, the method further comprises detecting the presence 
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or absence of selective binding between the compound and the cells or proteins within the sample. 
In preferred embodiments, the compound is labeled, and the sample comprises cells derived from 
the testis, pancreas, heart, or brain. 

In another embodiment, the protein of the invention or fragment thereof can be used to 

5 modulate die proliferation of cells. For example, the level or activity of the present protein can be 
increased in cells to increase the rate or extent of proliferation of the cells. In one such 
embodiment, the proliferation of cells in a biological sample is increased by contacting the 
biological sample with an amount of the present protein sufficient to increase the rate or extent of 
proliferation of one or more cells within the sample, or with a compound that increases the activity . 

10 or expression of the present protein within one or more cells of the sample. Such methods can be 
performed either in vitro or in vivo and, preferably, the cells comprise pancreatic, testicular, heart 
or brain cells. The level of the present protein can be increased in cells in any of a number of 
ways, including by administering purified protein to the cells, transfecting the cells with a 
polynucleotide encoding the protein, or administering a compound to the cells that causes an 

1 5 increase in the activity or expression of the protein. Alternatively, proliferation of cells can be 
inhibited by decreasing the level of the present protein in cells, for example using antisense 
molecules, or more specifically inhibit the activity of the present protein using direct or indirect 
inhibitor molecules or antagonistic antibodies directed against the present protein. 

In a further embodiment, the protein of the invention or fragment thereof can be used to 

20 modulate cellular calcium concentration and thereby modulate calcium-dependant signaling. 
Calcium transport can be modulated, for example, by contacting a biological sample with an 
amount of the present protein sufficient to increase calcium transport of one or more cells within 
the sample, or with a compound that increases the activity or expression of the present protein 
within one or more cells of the sample. Such methods can be used either in vitro or in vivo and 

25 preferably, but not limited to, the methods are performed on cells comprising pancreatic, testicular, 
heart or brain cells. The level of the present protein can be increased in cells in any of a number of 
ways, including by administering purified protein to the cells, transfecting the cells with a 
polynucleotide encoding the protein, or administering a compound to the cells that causes an 
increase in the activity or expression of the protein. Alternatively, the activity of the present 

30 protein can be inhibited by decreasing the level of the present protein in cells, for example using 
antisense molecules, by using direct or indirect inhibitor molecules or antagonistic antibodies of 
the present protein, or by expressing in the cells an inactive form of the protein that acts in a 
dominant negative fashion to inhibit the normal calcium signalling in the cells carried out by other 
members of the MS4A family. 

35 The present invention also provides animal models generated by modulating the expression 

or activity of the present protein in one or more tissues of the animal. Such animals represent an in 
vivo assay method for testing candidate molecules potentially useful for the treatment of various 
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pathophysiological aspects of diseases associated with abnormal calcium homeostasis and/or cell 
growth or any function specifically related to the activity of the present protein. These animals can 
be generated with any method of increasing or decreasing the expression of the present protein. 

hi another embodiment, since calcium is an universal intracellular messenger, controlling a 
5 diverse range of cellular processes such as gene transcription, cell proliferation, and more 
specifically muscle contraction, synaptic function, secretion of insulin in pancreatic islets of 
Langerhans, and many others, the present protein or fragment thereof provides a method of treating 
different pathological states arising from or associated with destabilization of calcium homeostasis 
in many organs (brain, kidney, parathyroid gland, pancreas, bone, intestine). In addition, any of 

10 these processes can be enhanced or inhibited in cells or in patients, even when the protein is at 
normal levels in the cells or in the cells of the patient, by causing a decrease or increase in the 
normal level of the protein in the cells. For any of the herein-described methods, the activity of the 
present protein can be increased or inhibited in any of a large number of ways, for example by 
using polyclonal or monoclonal antibodies, or any other compound having qualitative biological 

15 activity in common with a full-length antibody, that specifically binds to the present protein and 
exerts stimulatory or inhibitory effects on functions involving the present protein. 

Any compound interacting with the present protein and thereby promoting or interfering 
with its activities can also be used as a method of treating any of the pathologies described above. 
Such compounds can be identified, e.g., using interaction-screening approaches such as, but not 

.20 limited to, co-immunoprecipitation, two-hybrid methods. Further, compounds can be screened for 
the ability to modulate the activity of the present protein by providing a cell expressing the present 
protein, or providing lipid bilayers reconstituted with the present protein, and detecting the ability 
of a compound to modulate the activity of the present protein in the cell or in the bilayer. Such 
activity can be detected in any of a large number of ways, including but not limited to detecting 

25 calcium flux or calcium signalling in the cells or membranes, e.g. as manifest in the activity of 
downstream members of the signal transduction pathway. The present invention also provides an 
in vitro method to identify any compound able to promote or interfere with some or all activities of 
the present protein, the method comprising the steps of contacting the present protein with a test 
compound and detecting the ability of the compound to bind to or modulate the activity of the 

30 protein. Also in this embodiment, the present protein or any effective compound identified by this 
way of investigation useful for the treatment of disorders described above can be used in 
combination with other drugs or compounds. 

As it has been shown that multiple loci on chromosome 1 lql3 are relevant to atopic asthma (Adra 
CN. et al 9 Clin. Genet. (1999) June; 55(6):43 1-437), the present invention also provides a novel 
35 candidate gene for this condition. Accordingly, the present invention provides methods for the 
diagnosis of atopic asthma, the method comprising determining the identity of one or more 
nucleotides of the present nucleic acids in one or more cells of an individual suspected of having 
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the condition, or at risk of developing the condition, and determining if the cell or cell contains a 
nucleotide within the present nucleic acid sequence indicative of the condition, or of an elevated 
risk of developing the condition. The identity of such nucleotides can be determined in any of a 
number of ways, for example using any standard sequencing or genotyping method, many of 
5 which are well known in the art. 

Protein ofSEQ ID NO:36 (Internal designation Clone 1000769575_208-22-l-0-B2-F) 

The cDNA of Clone 1000769575_208-22-l-0-B2-F (SEQ ID NO:35) encodes the protein 
of SEQ ID NO:36 comprising the amino acid sequence 
MGMSSmLKYVIJFFFM.IJFWI 

10 MWAFLGCMGSIKENKCLLM 

WDSIQSFLQCCGINGTSDWTSGPPASCPSDRKVEGCYAKARLWraSNFFIRGPY. 
Accordingly it will be appreciated that all characteristics and uses of polypeptides of SEQ ID 
NO:36 described throughout the present application also pertain to the polypeptides encoded by the 
nucleic acids included in Clone 1000769575_208-22-l-0-B2-F. In addition, it will be appreciated 

1 5 that all characteristics and uses of the polynucleotides of SEQ ID NO:3 5 described throughout the 
present application also pertain to the nucleic acids included in Clone 1000769575_208-22-l-0- 
B2-F. A preferred embodiment of the invention is directed toward the compositions of SEQ ID 
NO:35, SEQ ID NO:36, and Clone 1000769575_208-22-l-0-B2-F. Also preferred are polypeptide 
fragments having a biological activity as described herein and the polynucleotides encoding the 

20 fragments. 

The protein of SEQ ID NO:36 encodes Antaginin, a complex splice variant of CD53 with 
novel function. In Antaginin, splicing of exon 4 onto exon 5 results in a deletion of 9 amino acids 
(4 from exon 4, 5 from exon 5) and a correspondingly unique junctional sequence. In addition, 
splicing of exon 7 onto normally 3'-untranslated nucleotide sequence within exon 8 results in a 
25 deletion of 14 amino acids from exon 7, as well as the deletion of the carboxy-terminal 23 amino 
acids of CD53 and its replacement with a unique carboxy-terminal sequence of 6 amino acids in 
Antaginin. 

CD53, restricted in expression to leukocytes, is a member of the tetraspaninin superfamily. 
CD53 is an integral membrane protein characterized by four transmembrane domains (TM1-TM4), 

30 forming a small and a large extracellular loop (EC1 and EC2, respectively), with short intracellular 
amino and carboxyl tails. EC1 and EC2 of CD53 comprise the amino acid sequences 37-54 and 
107-181, respectively (numbered from the initiating methionine of CD53). TM1-TM4 of CD53 
comprise the amino acid sequences 11-36, 55-69, 81-106, and 182-206, respectively (numbered 
from the initiating methionine of CD53) (Rost, B. et al., Prot Sci. 5:1704-18, (1996) which 

35 disclosure is hereby incorporated by reference in its entirety). 

CD53 facilitates the assembly of modular signalling complexes at the cell surface. 
Specifically, CD53 acts as an adaptor to functionally link an extracellular ligand-binding domain 
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(such as that of beta 1 integrin) to an intracellular domain involved in signal transduction (such as 
that of protein kinase C) (Zhang, XA et al. s J. Biol. Chem. (2001) which disclosure is hereby 
incorporated by reference in its entirety). Beta 1 integrin has been shown to associate with CD53 
through EC2. Moreover, through its interaction with other tetraspaninins, CD53 is incorporated 
5 into a higher order tetraspaninin web exisiting at the cell surface. CD53 displays numerous 

properties that indicate its physiological importance in cell adhesion, motility, activation (including 
the delivery of a co-stimulatory signal to for CD3/T cell receptor-mediated T cell activation), and 
proliferation (Boucheix, C. et al. Expert Reviews in Molecular Medicine (2001) which disclosure 
is hereby incorporated by reference in its entirety). 

10 Antaginin is characterized by a highly perturbed EC2 loop and a highly divergent TM4 

transmembrane domain. The EC2/TM4 region of Antaginin comprises amino acids 107-179 
(numbered from the initiating methionine of Antaginin). In addition, Antaginin is characterized by 
an extracelluar perturbation of the amino acid sequence at the junction of exons 4 and 5 (amino 
acids 124/125, numbered from the initiating methionine of Antaginin) (Rost, B. et al., Prot. Sci. 

15 5:1704-18, (1996) which disclosure is hereby incorporated by reference in its entirety). Antaginin 
antagonizes CD53-facilitated assembly of functional modular signalling complexes at the cell 
surface. 

In a preferred embodiment, the present invention provides for an antibody that specifically 
binds Anataginin of the present invention. Further preferred is a method for making such antibody 

20 wherein a mouse is immunized with a syngeneic cell line transfected with Antaginin. Monoclonal 
antibodies derived from said mouse are screened for binding to the Antaginin-transfected cell line 
but not to the identical cell line transfected with human CD53. Antibody specificity is further 
established through amino acid sequence analysis of immunoprecipitated material. Further 
preferred is a method for making said antibody wherein said antibody binds to EC1 or the 
. 25 sequence carboxyl-terminal (EC2/TM-4 region) of Antaginin. EC 1 and the EC2/TM4 region of 
Antaginin comprise the amino acid sequences 37-54 and 107-179, respectively (numbered from the 
initiating methionine of Antaginin). Further preferred is a method for making said antibody 
wherein said antibody binds to the EC2/TM4 region of Antaginin. Methods of generating said 
monoclonal antibody and of establishing its specificity are well known to those skilled in the art. 

30 In a preferred embodiment, the present invention provides for a method of contacting said 

antibody and specifically binding it with Antaginin. Further preferred is a method for using said 
antibody diagnostically to determine the basis for an impaired immune response. Further preferred 
is a method of using said antibody diagnostically in a flow cytometric analysis of Antaginin 
expression by leukocytes in a pathological context. Further preferred is a method of using said 

35 antibody diagnostically in a flow cytometric analysis of Antaginin expression by leukocytes in the 
context of viral infection wherein the virus is selected from, but not restricted to, the group 
consisting of: (a) Cytomegalovirus; (b) Human immunodeficiency virus; (c) Human herpes virus 6 
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(HHV 6); (d) Hepatitis C virus; and (e) Hepatitis D virus. 

Further preferred is a method of using said antibody diagnostically in a flow cytometric 

analysis of Antaginin expression by normal leukocytes in the leukemic patient to determine the 

basis for an impaired anti-tumor immune response wherein the leukemia is selected from, but not 
5 restricted to, the group consisting of: (a) B cell acute lymphoblastic leukemia (B-ALL); 

(b) Chronic lymphocytic leukemia (CLL); (c) T cell acute lymphoblastic leukemia (T-ALL); 

(d) Multiple myeloma; and (e) Acute myeloid leukemia (AML). 

Further preferred is a method of using said antibody diagnostically in a flow cytometric 

analysis of Antaginin expression by normal leukocytes in the cancer patient to determine the basis 
10 for an impaired anti-tumor immune response wherein the cancer is selected from, but not restricted 

to, the group consisting of: (a) Melanoma; (b) Breast carcinoma; (c) Lung carcinoma; (d) Colon 

carcinoma; (e) Hodgkin's lymphoma; (f) Non-Hodgkin's lymphoma; (g) Prostatic carcinoma; 

(h) Pancreatic carcinoma; (i) Uterine carcinoma; (j) Ovarian carcinoma; (k) Testicular carcinoma; 

(1) Renal carcinoma; (m) Hepatic carcinoma; and (n) Lung non-small-cell carcinoma. 
1 5 The threshold for leukocyte activation can be regulated by cytokine. In a further 

embodiment, the present invention provides for the use of said Antaginin antibody in in vitro 

analysis of cytokine regulation of Antaginin expression by leukocytes. Further preferred is a 

method of using said antibody in a flow cytometric analysis of said regulation by cytokine wherein 

the cytokine is selected from, but not restricted to, the group consisting of: (a) Interferon gamma; 
20 (b) Interleukin 17; (c) Interleukin 4; (d) Interleukin 10; (e) Interleukin 13; (f) Interleukin 15; 

(g) Interleukin 1; (h) Interleukin 6; (i) Monocyte chemotactic protein 1 (MCP-1); (j) Interleukin 8; 

and (k) Tumor necrosis factor alpha. 

Further preferred is a method of contacting said antibody with Antaginin and thereby 

sterically inhibiting the capacity of Antaginin to antagonize the CD53-facilitated assembly of 
25 functional modular signalling complexes at the cell surface. In so doing, said Antaginin antibody 

up-regulates CD53-mediated leukocyte activation. Preferred compositions comprise the Antaginin 

antibody or fragments or derivatives thereof. Preferred route of administration is intravenous 

injection. 

hi a further embodiment of the invention, said Antaginin antibody is incorporated as an 
30 adjuvant in vaccine preparations in a method to up-regulate the elicited immune response. In said 
method, said Antaginin antibody facilitates the CD53-mediated leukocyte activation contributing 
to establishment of specific immunity. Said Antaginin antibody up-regulates CD53-mediated 
leukocyte activation by sterically inhibiting the capacity of Antaginin to antagonize the CD53- 
facilitated assembly of functional modular signaling complexes at the cell surface. Further 
35 preferred is a method to use said antibody in a vaccine targeting a viral infection wherein the virus 
is selected from, but not restricted to, the group consisting of: (a) Human immunodeficiency virus; 
(b) Human herpes virus 6 (HHV 6); (c) Hepatitis C virus; (d) Hepatitis D virus; (e) Hepatitis E 
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virus; (f) Cytomegalovirus; (g) Respiratory syncytial virus; (h) Herpes simplex virus type I; 
(i) Herpes simplex virus type II; (j) Influent virus; (k) Parvovirus; (1) Coxsachie virus; 
(m) Echovirus; (n) Epstein-Barr virus; (o) Dengue virus; (p) Lassa fever virus; and (q) Ebola virus. 
Further preferred is a method to use said Antaginin antibody in a vaccine targeting a 
5 protozoan infection wherein the protozoa is selected from, but not restricted to, the group 
consisting of: (a) Entamoeba histolytica; (b) Cryptosporidium parvum; (c) Plasmodium 
falciparum; (d) Trypanosoma; (e) Leishmania; (f) Trichomonas vaginalis; and (g) Acanthamoeba. 

Viruses can suppress the immune response as a means of evading immune surveillance. In 
a further embodiment of the invention, said Antaginin antibody is used in a method of up- 

10 regulating the immune response against an ongoing viral infection. In said method, said Antaginin 
antibody facilitates the CD53-mediated leukocyte activation contributing to the anti-viral immune 
response. Said Antaginin antibody up-regulates CD53-mediated leukocyte activation by sterically 
inhibiting the capacity of Antaginin to antagonize the CD53-facilitated assembly of functional 
modular signaling complexes at the cell surface. Further preferred is a method of up-regulating the 

1 5 immune response against an ongoing viral infection wherein the virus is selected from, but not 
restricted to, the group consisting of: (a) Human immunodeficiency virus; (b) Human herpes virus 
6 (HHV 6); (c) Hepatitis B virus; (d) Hepatitis C virus; (e) Hepatitis D virus; (f) Cytomegalovirus; 
(g) Respiratory syncytial virus; (h) Influenza virus; (i) Herpes simplex virus type I; (j) Herpes 
simlex virus type II; (k) Epstein Barr virus; (1) Varicella zoster virus; (m) Morbillivirus; 

20 (n) Parmyxovirus; (o) Papilloma virus; (p) Adenovirus; (q) Dengue virus; (r) Lassa fever virus; 
(s) Coxsachie virus; (t) Echovirus; and (u) Ebola virus. 

Bacteria can suppress the immune response as a means of evading immune surveillance. 
In a further embodiment of the invention, said Antaginin antibody is used in a method of up- 
regulating the immune response against an ongoing bacterial infection. In said method, said 

25 Antaginin antibody facilitates the CD53-mediated leukocyte activation contributing to the anti- 
bacterial immune response. Said Antaginin antibody up-regulates CD53-mediated leukocyte 
activation by sterically inhibiting the capacity of Antaginin to antagonize the CD53-facihtated 
assembly of functional modular signaling complexes at the cell surface. Further preferred is a 
method of up-regulating the immune response against an ongoing bacterial infection wherein the 

30 bacteria is selected from, but not restricted to, the group consisting of: (a) Mycobacterium avium 
complex; (b) Pneumocystis carinii; (c) Acne vulgaris; (d) Legionella pneumophilia; (e) Yersinia 
pestis; (f) Ureaplasma urealyticum; (g) Chlamydia pneumoniae; (h) Helicobacter pylori; 
(i) Treponema pallidum; Q) Neisseria gonorrhoeae; (k) Salmonella typhimurium; (1) Vibrio 
cholera; (m) Clostridium difficile; (n) Bacillary dysentary; (o) Pencillin resistant Pneumococcus; 

35 (p) Burkholderia mallei; (q) Mycobacterium leprae; (r) Mycobacterium haemophilum; 
(s) Mycobacterium kansasii; (t) Haemophilus influenzae; and (u) Bacillus anthracis. 

Protozoa can suppress the immune response as a means of evading immune surveillance. 
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In a further embodiment of the invention, said Antaginin antibody is used in a method of up- 
regulating the immune response against an ongoing protozoan infection. In said method, said 
Antaginin antibody facilitates the CD53-mediated leukocyte activation contributing to the anti- 
protozoan immune response. Said Antaginin antibody up-regulates CD53-mediated leukocyte 
5 activation by sterically inhibiting the capacity of Antaginin to antagonize the CD53-facilitated 
assembly of functional modular signaling complexes at the cell surface. Further preferred is a 
method of up-regulating the immune response against an ongoing protozoan infection wherein the 
protozoa is selected from, but not restricted to, the group consisting of: (a) Entamoeba histolytica; 
(b) Cryptosporidium parvum (c) Giardia lamblia; (d) Toxoplasma gondii; (e) Isospora belli; 

10 (f) Encephalitozoon cuniculi; (g) Enterocytozoon bieneusi; (h) Plasmodium falciparum; 
(i) Trypanosoma; (j) Leishmania; (k) Trichomonas vaginalis; and (1) Acanthamoeba. 

In a further embodiment of the invention, said Antaginin antibody is used in a method of 
up-regulating the immune response against an ongoing fungal infection wherein the fungus is 
selected from, but not restricted to, the group consisting of: (a) Cryptococcal meningitis; 

15 (b) Histoplasma capstulatum; (c) Coccidiodes immitis; and (d) Candida albicans. 

Tumors can suppress the immune response as a means of evading immune surveillance. In 
a further embodiment of the invention, said Antaginin antibody is used in a method of up- 
regulating the immune response against a tumor. In said method, said Antaginin antibody 
facilitates the CD53-mediated leukocyte activation contributing to the anti-tumor immune 

20 response. Said Antaginin antibody up-regulates CD53-mediated leukocyte activation by sterically 
inhibiting the capacity of Antaginin to antagonize the CD53-facilitated assembly of functional 
modular signaling complexes at the cell surface. Further preferred is a method of up-regulating the 
immune response against a tumor wherein the tumor is selected from, but not restricted to, the 
group consisting of: (a) Melanoma; (b) Breast carcinoma; (c) Lung carcinoma; (d) Colon 

25 carcinoma; (e) Hodgkin's lymphoma; (f) Non-Hodgkin's lymphoma; (g) Prostatic carcinoma; 
(h) Pancreatic carcinoma; (i) Uterine carcinoma; (j) Ovarian carcinoma; (k) Testicular carcinoma; 
(1) Renal carcinoma; (m) Hepatic carcinoma; and (n) Lung non-small-cell carcinoma. 

In a further embodiment of the invention, said Antaginin antibody is incorporated as an 
adjuvant in therapeutic anti-tumor vaccines wherein the tumor is selected from, but not restricted 

30 to, the group consisting of: (a) Melanoma; (b) Breast carcinoma; (c) Lung carcinoma; (d) Colon 
carcinoma; (e) Hodgkin's lymphoma; (f) Non-Hodgkin's lymphoma; (g) Prostatic carcinoma; 
(h) Pancreatic carcinoma; (i) Uterine carcinoma; (j) Ovarian carcinoma; (k) Testicular carcinoma; 
(1) Renal carcinoma; (m) Hepatic carcinoma; and (n) Lung non-small-cell carcinoma. 

Intracellular (macrophage) pathogens can be eliminated either through macrophage 

35 activation or through lysis of infected macrophages by cytolytic T lymphocytes (Chun et al., J. 
Exp. Med. 193:1213 (2001) which disclosure is hereby incorporated by reference in its entirety). 
In a further embodiment of the invention, said Antaginin antibody is used in a method to eliminate 
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intracellular pathogens by facilitating macrophage activation or cytolytic T lymphocyte generation 
wherein the pathogen is selected from, but not restricted to, the group of intracellular (macrophage) 
pathogens consisting of: (a) Histoplasma capsulatum; (b) Mycobacterium tuberculosis; 

(c) Salmonella typhimurium; (d) Chlamydia trachomatis; and (e) Pneumocystis carinii. 

5 There have been several examples of tetraspanins playing a role in the viral life cycle. 

Anti-tetraspanin antibodies inhibit syncytium formation and/or virus production. This was 
observed for the tetraspanins CD81 and CD82 with human T-lymphotropic virus 1, and for the 
tetraspanin CD9 with the feline immunodeficiency virus and the canine distemper virus. It is also 
. believed that the tetraspanin CD81 also plays a role in the aetiopathogenesis of hepatitis C virus 

10 (Boucheix, C. et al. (2001) which disclosure is hereby incorporated by reference in its entirety). In 
a further embodiment of the invention, said Antaginin antibody is used in a method of blocking 
viral infection when Antaginin is used as a virus receptor. Further preferred is the use of said 
Antaginin antibody in a method of blocking said viral infection when Antaginin used as said virus 
receptor and is expressed by a leukocyte type selected from, but not restricted to, the group of 

15 leukocyte types consisting of: (a) T lymphocyte; (b) B lymphocyte; (c) NK lymphocyte; 

(d) Monocyte; (e) Macrophage; (f) Neutrophil; and (g) Dendritic cell. 

In a further preferred embodiment, the present invention provides for a method of 
screening test compounds for the ability to bind Antaginin and either inhibit or promote the 
capacity of Antaginin to interfere with CD53 function. Further preferred is a method of screening 

20 said test compounds for the ability to bind Antaginin and either inhibit or promote the capacity of 
Antaginin to interfere with CD53 function as it relates its facilitation of signal transduction through 
beta 1 integrin (Zhang, XA et al., J. Biol. Chem. (2001) which disclosure is hereby incorporated by 
reference in its entirety). Further preferred is a method of screening said test compounds for the 
ability to bind Antaginin and either inhibit or promote the capacity of Antaginin to interfere with 

25 the CD53-facilitated association of protein kinase C with beta 1 integrin. Further preferred is a 
method of screening said test compounds for the ability to bind Antaginin and either inhibit or 
promote the association of protein kinase C with beta 1 integrin in a beta 1 (alpha3betal, 
alpha4betal, or alpha6betal)-expressing cell line transfected with CD53 and Antaginin but not in 
the identical cell line transfected with CD53 alone. Methods of screening said test compounds and 

30 for characterizing their effect on CD53-facilitated association of protein kinase C with beta 1 
integrin are well known to those skilled in the art. 

Preferred formulation of said compound is that selected from, but not restricted to, 
formulations compatible with the routes of delivery selected from the group: (a) Oral; 
(b) Transdermal; (c) Injection; (d) Buccal; and (d) Aerosol. 

35 Compounds found to bind Antaginin and to inhibit the capacity of Antaginin to interfere 

with CD53 function, thereby effectively up-regulating CD53 activity, are used in methods 
analogous to those described above for Antaginin antibody. 
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Compounds found to bind Antaginin and to promote the capacity of Antaginin to interfere 
with CD53 function effectively down-regulate CD53 activity. Such compounds have application 
to chronic inflammatory autoimmune disease and to other disorders of immune dysregulation. 
Such compounds down-regulate CD53-mediated leukocyte activation by promoting the capacity of 

5 Antaginin to antagonize the CD53-facilitated assembly of functional modular signaling complexes 
at the cell surface. In a further embodiment of the invention, said compound is used in a method of 
contacting Antaginin to down-regulate a dysregulated immune response and thereby treat the 
associated immune disorder wherein said immune disorder is selected from, but not restricted to, 
the group: (a) Rheumatoid arthritis; (b) Inflammatory bowel disease; (c) Insulin dependent 

10 diabetes mellitus (Type 1 diabetes); (d) Multiple sclerosis; (e) Systemic lupus erythematosus; 
(f) Psoriasis; (g) Allergic asthma; (h) Allergic rhinitis (hayfever); and (i) Graft versus host disease. 
In a further embodiment of the invention, said test compound having the ability to promote the 
capacity of Antaginin to interfere with CD53 function is used in a method to suppress acute 
inflammation. Said test compounds down-regulate CD53-mediated leukocyte activation by 

1 5 promoting the capacity of Antaginin to antagonize the CD53-facilitated assembly of functional 
modular signalling complexes at the cell surface. Further preferred is a method to use said test 
compound to suppress inflammation associated with wound healing. Further preferred are 
compositions comprised of said test compound used in methods of contacting a wound or injured 
tissue with an ameliorative effective amount by injection or transdermal contact at the site of the 

20 wound. 

Protein of SEQ ID NO:38 (Internal designation Clone 146994JL06-023-4-0-C9-F) 

The cDNA of Clone 146994_106-023-4-0-C9-F (SEQ ID NO:37) encodes the protein of 
SEQ ID NO:38 comprising the amino acid sequence: 
MSPGQPMTFPPEALW\nrVGLSVCLIALL^ 

25 KTALQPLKHSDSKEDDGQEIA. Accordingly it will be appreciated that all characteristics and 
uses of polypeptides of SEQ ID NO:38 described throughout the present application also pertain to 
the polypeptides encoded by the nucleic acids included in Clone 146994_106-023-4-0-C9-F. In 
addition, it will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NO:37 described throughout the present application also pertain to the nucleic acids included in 

30 Clone 146994_106-023-4-0-C9-F. A preferred embodiment of the invention is directed toward the 
compositions of SEQ ID NO:37, SEQ ID NO:38, and Clone 146994J06-023-4-0-C9-F. Also 
preferred are polypeptide fragments having a biological activity as described herein and the 
polynucleotides encoding the fragments. 

The protein of SEQ ID NO:38 encodes Beferin. Beferin is a novel splice variant of two 

35 recently described members of the B lymphocyte activation antigen B7 (BLAA) family, B7-H3 
and Blaa. Beferin has novel function as described below. 

B7-H3 was identified as a human B7-like molecule with T lymphocyte costimulatory 
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activity (Chapoval, AI et al., Nature Immunology 2:269-74 (2001) which disclosure is hereby 
incorporated by reference in its entirety). B7-H3 has the structure: 
[Signal peptide]-[IgV-like domain l]-[IgC-like domain 2]-[transmembrane region]- 
[cytoplasmic tail]. 

5 Blaa (NCBI Accession No. AX097550) was identified as a human B7-like molecule, as 

described in Patent Application WO001 18204 A I ("Polynucleotides encoding members of the 
human B lymphocyte activation antigen B7 family and polypeptides encoded thereby") and 
incorporated by reference in its entirety. Blaa has the structure: 
[Signal peptide]-[IgV-like domain l]-[IgC-like domain l]-[IgV-like domain 2]- 
1 0 [IgC-like domain 2]-[transmembrane region]-[cytoplasmic tail] . 

Blaa (NCBI Accession No. AX047070) was independently identified as a protein with 
beta-secretase (beta-amyloid-converting enzyme) activity, as described in Patent Application 
WO00068266A1 ("Amyloid precursor protein protease and related nucleic acid compositions") 
and incorporated by reference in its entirety. The amino acid sequence of AX947070 is identical to 
15 thatofAX097550. 

IgV-like domain 1 is highly similar, but not identical, to the amino acid sequence of IgV- 
like domain 2. IgC-like domain 1 is highly similar, but not identical, to the amino acid sequence of 
IgC-like domain 2. 

In the case of Beferin, a novel 5' exon is spliced directly onto the exons encoding the 

20 transmembrane region and cytoplasmic tail. This results in the deletion of the IgV-like and IgC- 
like extracellular domains. The short extracellular tail of Beferin is comprised of approximately 
seven amino acids shared with B7-H3 and Blaa preceded by three novel (not found in either B7-H3 
or Blaa) N-terminal amino acids encoded by the novel 5 5 exon (underlined here): MSPGQPMTFP. 
Costimulation, in addition to T cell receptor engagement, is required for optimal activation 

.25 of T cells. The most extensively studied costimulatory molecules are members of the B 
lymphocyte activation antigen B7 family, of which there are presently five. Each B7 family 
member binds to one or more counter-receptor on the T cell, of which there are presently four. B7- 
H3 is highly expressed in many human tissues including heart, liver, placenta, prostate, testis, 
uterus, pancreas, small intestine, and colon. Low expression of B7-H3 was also found in brain, 

30 skeletal muscle, kidney, and lung. B7-H3 is not detectable in peripheral blood mononuclear cells, 
although it can be induced on dendritic cells and monocytes by inflammatory cytokines. Several 
tumor lines also express B7-H3, including those derived from melanoma, cervical 
adenocarcinoma, chronic myelogenous leukemia, lung carcinoma, and colorectal adenocarcinoma. 
B7-H3 costimulates proliferation of both CD4 + and CD8 + T cells, enhances the induction of 

35 cytotoxic T lymphocytes (CTL), and selectively stimulates proinflammatory cytokine interferon 
gamma (IFNgamma) production in the presence of T cell receptor signaling. B7-H3 exists as non- 
covalent oligomers on the antigen-presenting cell, and this is important for high-avidity binding of 
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B7-H3 to its counter-receptor in its role as T cell costimulator. 

In non-neuronal tissue, Blaa cleaves the 751 amino acid isoform of amyloid beta protein 
precursor (APP75 1) at the cell surface by virtue of its beta-secretase activity to generate a soluble 
fragment identical to the serine protease inhibitor protease nexin 2 (PN2). PN2 and its Kunitz 
5 protease inhibitory domain have been shown to be inhibitors of coagulation factor Vila (FVIIa) 
and factor Vila-tissue factor complex (FVEa-TF) (Mahdi, F et aL, Thromb. Res. 99:267-76 (2000) 
which disclosure is hereby incorporated by reference in its entirety) initiators of the extrinsic 
coagulation cascade. TF expression and its engagement of the extrinsic coagulation pathway by 
ovarian cancer cells has been shown to play role in metastasis of the cancer (Fischer, EG et aL, J. 

10 Clin. Invest 104:1213-21 (1999) which disclosure is hereby incorporated by reference in its 
entirety). Factor Xa (FXa) generated by FVHa-TF has been shown to lead to pro-inflammatory 
activation of vascular endothelial cells through its cleavage of protease-activated receptor 2 
(PAR2) (Camerer, E et aL, Proc. Natl. Acad. Sci. USA 97:5255-60 (2000) which disclosure is 
hereby incorporated by reference in its entirety). FXa can also elicit a pro-inflammatory cellular 

15 response by cleavage of protease-activated receptor 1 (PARI) (Kravchenko, RM Blood 97:3 109- 
16 (2001) which disclosure is hereby incorporated by reference in its entirety). 

Beferin interferes with B7-H3 co-stimulation of T lymphocytes through its non-productive 
incorporation into B7-H3 oligomers at the cell surface. One function of Beferin therefore is to 
negatively regulate T lymphocyte co-stimulation, hi a pathological context, Beferin up-regulation 

20 facilitates evasion of immune surveillance by pathogens and tumor cells. 

Beferin interferes with Blaa generation of PN2 through its non-productive interactions with 
APP751. A second functional consequence of Beferin expression is therefore up-regulated 
engagement of the extrinsic coagulation coagulation pathway, including the generation of FXa. In 
a pathological context, Beferin up-regulation facilitates hypercoagulability and cancer metastasis. 

25 In a preferred embodiment, the present invention provides for an antibody that specifically 

binds Beferin of the present invention. Further preferred is a method for making such antibody 
wherein a mouse is immunized with a syngeneic cell line transfected with Beferin. Monoclonal 
antibodies derived from said mouse are screened for binding to the Beferin-transfected cell line but 
not to the identical cell line transfected with human B7-H3 or Blaa. Antibody specificity is further 

30 established through amino acid sequence analysis of immunoprecipitated material. Further 
preferred is a method for making said antibody wherein said antibody specifically binds all or in 
part to the extracellular amino terminus of Beferin. The extracellular amino terminus of Beferin is 
comprises the amino acid sequence 1-10 (numbered from the initiating methionine of Beferin). 
Methods of generating said monoclonal antibody and of establishing its specificity are well known 

35 to those skilled in the art. 

In a preferred embodiment, the present invention provides for a method of contacting said 
antibody and specifically binding it with Beferin. Further preferred is a method for using said 
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antibody diagnostically to determine the basis for an impaired immune response or for 
hypercoagulability. Further preferred is a method of using said antibody diagnostically in a flow 
cytometric analysis of Beferin expression by leukocytes in a pathological context. Further 
preferred is a method of using said antibody diagnostically in an immunohistochemical analysis of 
5 Beferin expression by tissue in a pathological context. Methods of carrying out 

immunohistochemical or flow cytometric analysis are well known to those skilled in the art. 

Further preferred is a method of using said antibody diagnostically in a flow cytometric 
analysis of Beferin expression by normal leukocytes and leukemia cells in the leukemic patient to 
determine the basis either for an impaired anti-tumor immune response or for hypercoagulability 

10 wherein the leukemia is selected from, but not restricted to, the group consisting of: (a) B cell 
acute lymphoblastic leukemia (B-ALL); (b) Chronic lymphocytic leukemia (CLL); (c) T cell acute 
lymphoblastic leukemia (T-ALL); (d) Multiple myeloma; and (f) Acute myeloid leukemia (AML). 

Further preferred is a method of using said antibody diagnostically in a flow cytometric 
analysis of Beferin expression by leukocytes in a patient with viral infection to determine the basis 

1 5 either for an impaired anti-viral immune response or for hypercoagulability wherein the virus is 
selected from, but not restricted to, the group consisting of: (a) Cytomegalovirus; (b) Human 
herpes virus 6 (HHV 6); (c) Human immunodeficiency virus; (d) Hepatitis C virus; and 
(e) Hepatitis D virus. 

Further preferred is a method of using said antibody diagnostically in an 

20 immunohistochemical analysis of Beferin expression by tissue to determine the basis for 

hypercoagulability wherein said tissue is selected from, but not restricted to, the group consisting 
of: (a) Heart; (b) Liver; (c) Placenta; (d) Prostate; (e) Testis; (f) Uterus; (g) Pancreas; (h) Small 
intestine; (i) Colon; (j) Kidney; and (k) Lung. 

Further preferred is a method of using said antibody diagnostically in an 

25 immunohistochemical analysis of Beferin expression by tumor cells to determine the basis either 
for an impaired anti-tumor immune response or for hypercoagulability wherein the tumor cell is 
selected from, but not restricted to, the group consisting of: (a) Melanoma; (b) Breast carcinoma; 
(c) Lung carcinoma; (d) Colon carcinoma; (e) Hodgkin's lymphoma; (f) Non-Hodgkin's 
lymphoma; (g) Prostatic carcinoma; (h) Pancreatic carcinoma; (i) Uterine carcinoma; (j) Ovarian 

30 carcinoma; (k) Testicular carcinoma; (1) Renal carcinoma; (m) Hepatic carcinoma; and (n) Liing 
non-small-cell carcinoma. 

The efficiency of T lymphocyte co-stimulation, as well as coagulability status, can be 
regulated by cytokine. In a further embodiment, the present invention provides for the use of said 
Beferin antibody in in vitro analysis of cytokine regulation of Beferin expression by normal 

35 leukocytes. Further preferred is a method of using said antibody in a flow cytometric analysis of 
said regulation by cytokine wherein the cytokine is selected from, but not restricted to, the group 
consisting of: (a) Interferon gamma; (b) Interleukin 17; (c) Ihterleukin 4; (d) Interleukin 10; 
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(e) Interleukin 13; (f) Interleukin 15; (g) Interleukin 1; (h) Interleukin 6; (i) Monocyte chemotactic 
protein 1 (MCP-1); (j) Vascular endothelial growth factor (VEGF); (k) Transforming growth factor 
beta; (1) Interleukin 8; and (m) Tumor necrosis factor alpha. 

In a further embodiment, the present invention provides for the use of said Beferin 
5 antibody in in vitro analysis of cytokine regulation of Beferin expression by non-leukocytic cell 
lines. Further preferred is a method of using said antibody in a flow cytometric analysis of said 
regulation by cytokine wherein the cytokine is selected from, but not restricted to, the group 
consisting of: (a) Interferon gamma; (b) Interleukin 17; (c) Interleukin 4; (d) Interleukin 10; 
(e) Interleukin 13; (f) Interleukin 15; (g) Interleukin 1; (h) Interleukin 6; (i) Monocyte chemotactic 

10 protein 1 (MCP-1); (j) Vascular endothelial growth factor (VEGF); (k) Transforming growth factor 
beta; (1) Interleukin 8; and (m) Tumor necrosis factor alpha. 

Further preferred is a method of contacting and specifically binding said antibody with 
Beferin and thereby sterically inhibiting the non-productive incorporation of Beferin into B7-H3 
oligomers at the cell surface. In so doing, said Beferin antibody up-regulates B7-H3-mediated T 

15 lymphocyte co-stimulation. Further preferred is a method of contacting and specifically binding 
said antibody with Beferin and thereby sterically interfering with the non-productive interaction of 
Beferin with APP751, thereby un-regulating Blaa-mediated beta secretase cleavage of APP751 to 
generate PN2. As PN2 is an inhibitor of the extrinsic coagulation pathway at the level of FWa- 
TF, this in turn down-regulates coagulability status. Preferred compositions comprise the Beferin 

20 antibody or fragments or derivatives thereof. Preferred route of administration is intravenous 
injection. 

In a further embodiment of the invention, said Beferin antibody is incorporated as an 
adjuvant in vaccine preparations in a method to up-regulate the elicited immune response. In said 
method, said Beferin antibody facilitates the B7-H3-mediated T lymphocyte co-stimulation 

25 contributing to establishment of specific immunity. Said Beferin antibody up-regulates B7-H3- 
mediated T lymphocyte co-stimulation by sterically inhibiting the non-productive incorporation of 
Beferin into B7-H3 oligomers at the cell surface. Further preferred is a method to use said 
antibody in a vaccine targeting a viral infection wherein the virus is selected from, but not 
restricted to, the group consisting of: (a) Human immunodeficiency virus; (b) Human herpes virus 

30 6 (HHV 6); (c) Hepatitis C virus; (d) Hepatitis D virus; (e) Hepatitis E virus; (f) Cytomegalovirus; 
(g) Respiratory syncytial virus; (h) Herpes simplex virus type I; (i) Herpes simplex virus type II; 
(j) Influenza virus; (k) Parvovirus; (m) Coxsachie virus; (n) Echovirus; (o) Epstein-Barr virus; 
(p) Dengue virus; (q) Lassa fever virus; and (r) Ebola virus. 

Further preferred is a method to use said Beferin antibody in a vaccine targeting a 

35 protozoan infection wherein the protozoa is selected from, but not restricted to, the group 
consisting of: (a) Entamoeba histolytica; (b) Cryptosporidium parvum; (c) Plasmodium 
falciparum; (d) Trypanosoma; (e) Leishmania; (f) Trichomonas vaginalis; and (g) Acanthamoeba. 
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Viruses can suppress the immune response as a means of evading immune surveillance. In 
a further embodiment of the invention, said Beferin antibody is used in a method of up-regulating 
the immune response against an ongoing viral infection. In said method, said Beferin antibody 
facilitates the B7-H3-mediated T lymphocyte co-stimulation contributing to the anti-viral immune 
5 response. Said Beferin antibody up-regulates B7-H3 -mediated T lymphocyte co-stimulation by 
sterically inhibiting the non-productive incorporation of Beferin into B7-H3 oligomers at the cell 
surface. Further preferred is a method of up-regulating the immune response against an ongoing 
viral infection wherein the virus is selected from, but not restricted to, the group consisting of: 
(a) Human immunodeficiency virus; (b) Human herpes virus 6 (HHV 6); (c) Hepatitis B virus; 
10 (d) Hepatitis C virus; (e) Hepatitis D virus; (f) Cytomegalovirus; (g) Respiratory syncytial virus; 
(h) Influenza virus; (i) Herpes simplex virus type I; (j) Herpes simlex virus type II; (k) Epstein Ban* 
virus; (1) Varicella zoster virus; (m) Morbillivirus; (n) Parmyxovirus; (o) Papilloma virus; 
(p) Adenovirus; (q) Dengue virus; (r) Lassa fever virus; (s) Coxsachie virus; (t) Echovirus; and 
(u) Ebola virus. 

15 Bacteria can suppress the immune response as a means of evading immune surveillance. 

In a further embodiment of the invention, said Beferin antibody is used in a method of up- 
regulating the immune response against an ongoing bacterial infection. In said method, said 
Beferin antibody facilitates the B7-H3-mediated T lymphocyte co-stimulation contributing to the 
anti-bacterial immune response. Said Beferin antibody up-regulates B7-H3-mediated T 

20 lymphocyte co-stimulation by sterically inhibiting the non-productive incorporation of Beferin into 
B7-H3 oligomers at the cell surface. Further preferred is a method of up-regulating the immune 
response against an ongoing bacterial infection wherein the bacteria is selected from, but not 
restricted to, the group consisting of: (a) Mycobacterium avium complex; (b) Pneumocystis 
carinii; (c) Acne vulgaris; (d) Legionella pneumophilia; (e) Yersinia pestis; (f) Ureaplasma 

25 urealyticum; (g) Chlamydia pneumoniae; (h) Helicobacter pylori; (i) Treponema pallidum; 
(j) Neisseria gonorrhoeae; (k) Salmonella typhimurium; (1) Vibrio cholera; (m) Clostridium 
difficile; (n) Bacillary dysentary; (o) Pencillin resistant Pneumococcus; (p) Burkholderia mallei; 
(q) Mycobacterium leprae; (r) Mycobacterium haemophilum; (s) Mycobacterium kansasii; 
(t) Haemophilus influenzae; and (u) Bacillus anthracis. 

30 Protozoa can suppress the immune response as a means of evading immune surveillance. 

In a further embodiment of the invention, said Beferin antibody is used in a method of up- 
regulating the immune response against an ongoing protozoan infection. In said method, said 
Beferin antibody facilitates the B7-H3-mediated T lymphocyte co-stimulation contributing to the 
anti-protozoan immune response. Said Beferin antibody up-regulates B7-H3 -mediated T 

35 lymphocyte co-stimulation by sterically inhibiting the non-productive incorporation of Beferin into 
B7-H3 oligomers at the cell surface. Further preferred is a method of up-regulating the immune 
response against an ongoing protozoan infection wherein the protozoa is selected from, but not 
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restricted to, the group consisting of: (a) Entamoeba histolytica; (b) Cryptosporidium parvum 
(c) Giardia lamblia; (d) Toxoplasma gondii; (e) Isospora belli; (f) Encephalitozoon cuniculi; 
(g) Enterocytozoon bieneusi; (h) Plasmodium falciparum; (i) Trypanosoma; (j) Leishmania; 
(k) Trichomonas vaginalis; and (1) Acanthamoeba. 
5 In a further embodiment of the invention, said Beferin antibody is used in a method of up- 

regulating the immune response against an ongoing fungal infection wherein the fungus is selected 
from, but not restricted to, the group consisting of: (a) Cryptococcal meningitis; (b) Histoplasma 
capstulatum; (c) Coccidiodes immitis; and (d) Candida albicans. 

Tumors can suppress the immune response as a means of evading immune surveillance. In 

10 a further embodiment of the invention, said Beferin antibody is used in a method of up-regulating 
the immune response against a tumor. In said method, said Beferin antibody facilitates the B7-H3- 
mediated T lymphocyte co-stimulation contributing to the anti-tumor immune response. Said 
Beferin antibody up-regulates B7-H3 -mediated T lymphocyte co-stimulation by sterically 
inhibiting the non-productive incorporation of Beferin into B7-H3 oligomers at the cell surface. 

1 5 Further preferred is a method of up-regulating the immune response against a tumor wherein the 
tumor is selected from, but not restricted to, the group consisting of: (a) Melanoma; (b) Breast 
carcinoma; (c) Lung carcinoma; (d) Colon carcinoma; (e) Prostatic carcinoma; (f) Hodgkin's 
lymphoma; (g) Non-Hodgkin's lymphoma; (h) Pancreatic carcinoma; (i) Uterine carcinoma; 
(j) Ovarian carcinoma; (k) Testicular carcinoma; (1) Renal carcinoma; (m) Hepatic carcinoma; and 

20 (n) Lung non-small-cell carcinoma. 

In a further embodiment of the invention, said Beferin antibody is incorporated as an 
adjuvant in therapeutic anti-tumor vaccines wherein the tumor is selected from, but not restricted 
to, the group consisting of: (a) Melanoma; (b) Breast carcinoma; (c) Lung carcinoma; (d) Colon 
carcinoma; (e) Prostatic, carcinoma; (f) Pancreatic carcinoma; (g) Uterine carcinoma; (h) Ovarian 

25 carcinoma; (i) Testicular carcinoma; (j) Renal carcinoma; (k) Hepatic carcinoma; and (1) Lung 
non-small-cell carcinoma. 

Intracellular (macrophage) pathogens can be eliminated either through macrophage 
activation or through lysis of infected macrophages by cytolytic T lymphocytes (Chun et al., J. 
Exp. Med. 193:1213 (2001) which disclosure is hereby incorporated by reference in its entirety). 

30 Ligation of B7 family members expressed on the macrophage can lead to macrophage activation 
[Hirokawa, M Immunol. Lett. 50:95-8 (1996), which disclosure is hereby incorporated by 
• reference in its entirety]. In a further embodiment of the invention, said Beferin antibody is used in 
a method to eliminate intracellular pathogens by facilitating macrophage activation or cytolytic T 
lymphocyte generation wherein the pathogen is selected from, but not restricted to, the group of 

35 intracellular (macrophage) pathogens consisting of: (a) Histoplasma capsulatum; 

(b) Mycobacterium tuberculosis; (c) Salmonella typhimurium; (d) Chlamydia trachomatis; and 
(e) Pneumocystis carinii. 
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Tumors can engage the extrinsic coagulation pathway through TF expression as a means of 
facilitating metastasis. In a further embodiment of the invention, said Beferin antibody is used in a 
method of down-regulating said tumor engagement of the extrinsic coagulation pathway, hi said 
method, said Beferin antibody facilitates Blaa-mediated beta secretase cleavage of APP751 to 
5 generate PN2, which is an inhibitor of the extrinsic coagulation pathway at the level of FVIIa-TF. 
Said Beferin antibody facilitates Blaa-mediated generation of PN2 by sterically interfering with the 
non-productive interaction of Beferin with APP75 1 . Further preferred is a method of down- 
regulating tumor engagement of the extrinsic coagulation pathway wherein the tumor is selected 
from, but not restricted to, the group consisting of: (a) Melanoma; (b) Breast carcinoma; (c) Lung 
10 carcinoma; (d) Colon carcinoma; (e) Prostatic carcinoma; (f) Hodgkin's lymphoma; (g) Non- 
Hodgkin's lymphoma; (h) Pancreatic carcinoma; (i) Uterine carcinoma; (j) Ovarian carcinoma; 
(k) Testicular carcinoma; (1) Renal carcinoma; (m) Hepatic carcinoma; and (n) Lung non-small- 
cell carcinoma. 

In a further preferred embodiment, the present invention provides for a method of 

1 5 screening test compounds for the ability to bind Beferin and either inhibit or promote the capacity 
of Beferin to interfere with B7-H3 function. Further preferred is a method of screening said test 
compounds for the ability to bind Beferin and either inhibit or promote the capacity of Beferin to 
interfere with B7-H3-mediated T lymphocyte co-stimulation. Further preferred is a method of 
screening said test compounds for the ability to bind Beferin and either inhibit or promote the 

20 capacity of Beferin to interfere B7-H3-mediated T lymphocyte co-stimulation. Further preferred is 
a method of screening said test compounds for the ability to bind Beferin and either inhibit or 
promote B7-H3 -mediated T lymphocyte co-stimulation when the antigen-presenting cell is 
transfected with B7-H3 and Beferin but not when the identical cell is transfected with B7-H3 
alone. Methods of screening said test compounds and for characterizing their effect on B7-H3- 

25 mediated T lymphocyte co-stimulation are well known to those skilled in the art. 

Preferred formulation of said compound is that selected from, but not restricted to, 
formulations compatible with the routes of delivery selected from the group: (a) Oral; 
(b) Transdermal; (c) Injection; (d) Buccal; and (e) Aerosol. 

Compounds found to bind Beferin and to inhibit the capacity of Beferin to interfere with 

30 B7-H3 function, thereby effectively up-regulating B7-H3 activity, are used in methods analogous 
to those described above for Beferin antibody. 

Compounds found to bind Beferin and to promote the capacity of Beferin to interfere with 
B7-H3-mediated T lymphocyte co-stimulation effectively down-regulate B7-H3 activity. Such 
compounds have application to chronic inflammatory autoimmune disease and to other disorders 

35 of immune dysregulation. Such compounds down-regulate B7-H3-mediated T lymphocyte co- 
stimulation by promoting the non-productive incorporation of Beferin into B7-H3 oligomers at the 
cell surface. In a further embodiment of the invention, said compound is used in a method of 
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contacting Beferin to down-regulate a dysregulated immune response and thereby treat the 
associated immune disorder wherein said immune disorder is selected from, but not restricted to, 
the group: (a) Rheumatoid arthritis; (b) Inflammatory bowel disease; (c) Insulin dependent 
diabetes mellitus (Type 1 diabetes); (d) Multiple sclerosis; (e) Systemic lupus erythematosus; 
5 (f) Psoriasis; (g) Allergic asthma; (h) Allergic rhinitis (hayfever); and (i) Graft versus host disease. 
In a further preferred embodiment, the present invention provides for a method of 
screening test compounds for the ability to bind Beferin and inhibit the capacity of Beferin to 
interfere with Blaa function. Further preferred is a method of screening said test compounds for 
the ability to bind Beferin and up-regulate Blaa-mediated PN2 generation through APP75 1 

10 cleavage, thereby down-regulating engagement of the extrinsic coagulation pathway by virtue of 
PN2 being an inhibitor of said pathway. Further preferred is a method of screening said test 
compounds for the ability to bind Beferin and up-regulate Blaa-mediated PN2 generation by 
interfering with the non-productive interaction of Beferin with APP751 . Further preferred is a 
method of screening said test compounds for the ability to bind Beferin and up-regulate PN2 

15 release from an APP75 1 -expressing cell transfected with Beferin and Blaa but not from the 

identical cell line transfected with Blaa alone. Methods of screening said test compounds and for 
measuring the amount PN2 released into the culture medium are well known to those skilled in the 
art. 

Said compounds found to bind Beferin and to effect said down-regulation of the extrinsic 
20 coagulation pathway are used in methods in methods analogous to those described above for 
Beferin antibody. 

Protein of SEQ ID NO:40 (Internal designation Clone 1000838788_228-28-4-0-F7-F) 

The cDNA of Clone 1000838788J228-28-4-0-F7-F (SEQ ID NO:39) encodes the 
Reductase Protein (RP): 
25 MVSGRFYLSCLLLGSLGSMCILFTIYWMQ 
YGGASLVYRLPQSWVGPKLPWKLLH 
HSWLGITTWLFGCQWFLGFAWLLPWA 
LFFSLKNTTRPYHSLPSEAWA^ 

RPGSRPFPVTYVSVTGRQPYKSW (SEQ ID NO:40). Accordingly, it will be appreciated that 
30 all characteristics and uses of the polypeptides of SEQ ID NO:40 described throughout the present 
application also pertain to the polypeptides encoded by the nucleic acids included in Clone 
1000838788_228-28-4-0-F7-F. In addition, it will be appreciated that all characteristics and uses 
of the polynucleotides of SEQ ID NO:39 descried throughout the present application also pertain 
to the nucleic acids included in Clone 1000838788_228-28-4~0-F7-F. A preferred embodiment of 
35 the invention is directed toward the compositions of SEQ ID NO:39, SEQ ID NO:40, and Clone 
100083 8788_228-28-4-0-F7-F. Also preferred are polypeptide fragments having a biological 
activity described herein and the polynucleotides encoding the fragments. 
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RP is a novel member of the cytochrome b561 family of transmembrane electron transfer 
proteins. RP supplies reducing equivalents by catalyzing the transfer of electrons across a 
membrane from a donor to an electron acceptor. This process depends on the interaction of 
histidine residues within the protein and transition metals (usually iron). Also required are 
5 cofactors to act as electron donors and acceptors. Examples of electron donors include but are not 
limited to ascorbic acid, NADH, NADPH, flavins, and reducing polypeptides. Electron acceptors 
include but are not limited to semidehydroascorbic acid, NAD+, NADP+, oxidized flavin species, 
and electron-accepting polypeptide complexes. Therefore, RP requires membrane association, a 
transition metal cofactor, and electron donor/acceptor cofactors for activity. These "required 
10 components of RP activity" will be referred to hereafter as such. 

Preferred embodiments of the invention include: (1) a composition comprising an RP 
polypeptide sequence of SEQ ID NO:40; (2) a composition comprising an RP polypeptide 
fragment having biological activity; (3) a composition comprising a polynucleotide sequence of 
SEQ ID NO:39 encoding an RP polypeptide; (4) a composition comprising a polynucleotide 
15 sequence encoding an RP polypeptide fragment having biological activity. 

A method of reducing oxidized species of iron comprising the step of: contacting an RP 
polypeptide or polynucleotide construct comprising polynucleotides encoding an RP polypeptide 
with iron and a cell. Preferably, ferric iron is reduced to ferrous iron. Preferably, the cell is 
involved in iron-uptake. Further preferably, the cell is derived from duodenal or small intestinal 
20 epithelium. Further preferably, the cell is a brush border enterocyte. 

A method of reducing monooxygenases comprising the step of: contacting an RP 
polypeptide or polynucleotide construct comprising polynucleotides encoding an RP polypeptide 
with a monooxygenase enzyme and a cell. Preferably, the monooxygenase is peptidylglycine 
alpha-amidiating monooxygenase (PAM). Also preferred is the monooxygenase dopamine beta- 
25 hydroxylase (DBH). Preferably, the cell is an endocrine cell. Further preferably,the cell is a 
neuroendocrine cell. 

A method of screening for molecules that bind and/ or inhibit the ability of RP 
polypeptides to transfer electrons comprising the steps: (1) contacting an RP polypeptide with a 
test molecule; (2) detecting test molecule binding to said RP polypeptide; and (3) detecting test 
30 molecule inhibiting of RP polypeptide biological activity. Preferably, a test molecule is 
immobilized on a- semi-solid matrix. 

Also preferred is a test molecule immobilized on a solid matrix. Preferably, a test 
molecule binding to RP polypeptide is detected using fluorescently-labelled RP antibody. 
Preferably, RP biological activity is detected using a common redox assay. Further preferably, RP 
35 biological activity is detected using an MIT reduction assay. Also further preferred is RP 
biological activity detected using an NBT reduction assay. 

A method of inhibiting RP polypeptide-dependent electron transfer comprising the step in 
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contacting an RP polypeptide with an RP polypeptide inhibitor. 

RP polypeptides are capable of transferring electrons to iron species, for example, reducing 
ferric (HI) iron to ferrous (II) iron. Non-heme associated Fe (HI) is highly insoluble in the body, 
while reduced Fe (II) is more readily absorbed. Thus, a method for reducing Fe (HI) to Fe (II) is a 

5 highly desirable treatment for disorders such as hemolytic diseases (e.g., sickle cell anemia), 
hemoglobinopathies, low iron absorption, rheumatoid arthritis, hypoxia, anemias associated with 
pregnancy, end-stage renal failure, cancer chemotherapy, and AIDS (particularly in subjects who 
are being treated with zidovudine (AZT)), and chronic anemia. Furthermore, increased iron uptake 
enables rapid weight gain desired in livestock. In a preferred embodiment of the invention, an 

10 iron-reducing effective amount of RP polypeptides or a polynucleotide construct comprising 

polynucleotides encoding said polypeptide are used in a method to reduce oxidized species of iron. 
This method comprises the step of contacting a RP polypeptide or polynucleotide construct with 
required components of RP activity, iron, and cells. Preferred cells are those involved in iron- 
uptake. Further preferred cells are those of the duodenum and small intestinal epithelium such as 

15 brush border enterocytes [for review, see Siddiqi, S., et al. (2001) Curr. Opin. Gastroenterol. 
17:1 10-7, which disclosure is hereby incorporated by reference in its entirety]. 

RP is expressed in neuroendocrine tissues where it is localized to secretory vesicles. RP 
supplies reducing ability (i.e., electrons) to monooxygenase enzymes, which play a role in 
biosynthesis and processing of catecholamines (e.g., dopamine and norepinephrine) and peptide 

20 hormones (e.g., neuropeptides, gonadotropins, somatotropins, thyrotropins, corticotropins, and 
lactotropins such as vasopressin, oxytocin, and insulin). In a preferred embodiment of the 
invention, a reducing effective amount of RP polypeptides or polynucleotides encoding said 
polypeptides are used in a method to reduce monooxygenases, thereby increasing the activity of 
these enzymes. This method comprises the step of contacting a RP polypeptide or polynucleotide 

25 construct with required components of RP activity, monooxygenase enzymes, and cells. Preferred 
monooxygenase enzymes include but are not limited to peptidylglycine alpha-amidating 
monooxygenase (PAM) and dopamine beta-hydroxylase (DBH). Preferred cells are those that 
express endogenous monooxygenases, such as cells of the adrenal medulla, pituitary gland, and 
other neural and endocrine tissues. 

30 Delivery of RP polypeptide or a polynucleotide construct comprising polynucleotides 

encoding RP polypeptide to cells is accomplished by methods common to the art such as 
transfection, electroporation, or microinjection. Additional methods of contacting said 
polynucleotide construct with cells include but are not limited to: lipid vesicle delivery (including 
micelles, viral envelope components, lipsomes, and modified versions of these) as discussed in 

35 U.S. Patent 6,110,490, U.S. Patent 5,019,369, and P.C.T. 9704748, which disclosures are hereby 
incorporated by reference in their entireties; viral transduction (including attenuated lentiviral and 
adenoviral systems) as discussed in U.S. Patent 6,204,060, which disclosure is hereby incorporated 
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by reference in its entirety; and delivery of naked polynucleotides (preferably to cells of the 
gastrointestinal tract) as discussed in U.S. Patent 6,225,290, which disclosure is hereby 
incorporated by reference in its entirety. 

An example method of delivery comprises steps: i) compressing a polynucleotide 
5 construct, preferably comprising the polynucleotides encoding RP polypeptide operably linked to 
an expression control element (e.g., a CMV promoter to direct constitutive expression), into a lipid 
vesicle derived from any of the following list: viral envelopes, liposomes, micelles, gangliosides 
and modified versions of these, preferably GM-1 ganglioside and phosphatidylserine, as described 
in U.S. Patent 6,180,603, U.S. Patent 6,1 10,490 or P.C.T. 9704748, which disclosures are hereby 

1 0 incorporated by reference in their entireties; ii) targeting the lipid vesicle to specific cells, for 
example, by embedding a targeting moiety into the lipid envelope (e.g., growth hormone 
secretagogue for pituitary localization); iii) contacting the targeted vesicle with specific cells by 
methods common to the art such as injection or inhalant (U.S. Patent 6,1 10,490, P.C.T. 9704748, 
and U.S. Patent 6,180,603, which disclosures are hereby incorporated by reference in their 

15 entireties). 

In an additional example of delivery, a polynucleotide construct comprising 
polynucleotides encoding the RP polypeptide operably linked to an expression control element 
(e.g., a CMV promoter to direct constitutive expression or a brush border-specific promoter such as 
the sucrase promoter) is delivered orally (e.g., in a physiologically-acceptable liquid, slurry, syrup, 

20 paste, powder, pill, or capsule form) to increase iron absorption by brush border enterocytes in the 
duodenum. Said naked polynucleotide construct may be modified to specifically target certain 
cells of the intestine, for example, by adding an oligosaccharide modification specific for brush 
border cell lectins (e.g., wheat germ agglutinin). Said naked polynucleotide construct may further 
provide for site-specific integration into the genome of the target intestinal cell. For example, said 

25 construct can be modified such that polynucleotides encoding RP polypeptide and an operably 
linked promoter to are flanked by the position-specific integration markers of Sacchoromyces 
cerevisiae Ty3 (U.S. Patent 5,292,662, which disclosures are hereby incorporated by reference in 
their entirety). 

Further included in the present invention are methods of inhibiting the above RP activities using an 
30 inhibitor of RP. Thus, a preferred embodiment of the present invention is a method of inhibiting 
RP polypeptide-dependent electron transfer (including reduction of ferric iron to ferrous iron and 
reduction of monooxygenase enzymes) by contacting RP polypeptides with RP polypeptide 
inhibitors. A further embodiment of the invention is a method of screening for compounds that 
bind and/ or inhibit the ability of RP polypeptides to transfer electrons. This method comprises the 
35 steps of: i) contacting an RP polypeptide with a test compound; and ii) detecting whether said test 
compound binds and/ or inhibits RP polypeptide reducing activity. Detection of RP polypeptide 
binding is accomplished by methods common to the art (e.g., by immobilizating said test 
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compound on a solid or semi-solid matrix and detecting RP polypeptides by fluorescently-labelled 
RP antibody). Inhibition of RP polypeptide reducing activity is measured using common assays to 
detect redox and electron transfer activity, such as MTT reduction (Chakrabarti, R., et al. (2000) J. 
Cell Biochem. 1 8:133-8, which disclosure is hereby incorporated by reference in its entirety) or 
5 NBT reduction [Meerhof, L. and Roos, D. (1986) J. Leukoc. Biol. 39:699-71 1, which disclosure is 
hereby incorporated by reference in its entirety]. 

Protein of SEQ ID NO:42 (Internal designation Clone 1000943975_160-213-2-0-A5-F) 

The cDNA of Clone 1000943975_160-213-2-0-A5-F (SEQ ID NO:41) encodes the Small 
Secreted Serine Protease Inhibitor (SSSPI) comprising the amino acid sequence: 

10 MPACRLGPLAAALLLSLLLFG 

CCSAGCATFCSLPM3KEGSCPQVMNFPQLGLCRDQCQVDSQCPGQMKCCRNGCG 
VTPNF (SEQ ID NO:42). Accordingly, it will be appreciated that all characteristics and uses of 
the polypeptides of SEQ ID NO:42 described throughout the present application also pertain to the 
polypeptides encoded by the nucleic acids included in Clone 1000943975_160-213-2-0-A5-F. In 

15 addition, it will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NO:41 described throughout the present application also pertain to the nucleic acids included in 
Clone 1000943975_160-213-2-0-A5-F. A preferred embodiment of the invention is directed 
toward the compositions of SEQ ID NO:41, SEQ ID NO:42, and Clone 1000943975_l60-213-2-0- 
A5-F. Also preferred are polypeptide fragments having a biological activity as described herein 

20 and the polynucleotides encoding said fragments. 

The Small Secreted Serine Protease Inhibitor (SSSPI) includes two WAP (whey acidic 
protein)/ four-disulfide core domains, which are commonly found in serine protease inhibitors. 
• SSSPI is extremely stable due to the presence of extensive intramolecular disulfide bonds. The 
biological activity of SSSPI is to inhibit protein degradation by serine proteases determined, for 

25 instance, by tracking protein degradation by methods common to the art (e.g., Coomassie Blue 
stain). Furthermore, SSSPI activity is associated with retarding growth in tissues that include 
smooth muscle, colon, ovarian, and mammary tissues. 

In a preferred embodiment of the invention, SSSPI polypeptides or fragments thereof are 
used to screen libraries of compounds for formation of binding complexes between SSSPI 

30 polypeptide and the agent being tested. The fragment employed in such screening may be free in 
solution, affixed to a solid support, borne on a cell surface, or located intracellularly. The 
formation of binding complexes is measured by methods known in the art (e.g., fluorescent 
labeling or green fluorescent protein tagging of the test agent, SSSPI polypeptides, or antibodies 
against either). A preferred method for screening provides for high throughput screening of 

35 compounds having suitable binding affinity to SSSPI polypeptide. An example of this method 
comprises the steps: i) synthesizing large numbers of different small test compounds onto a solid 
substrate, such as plastic pins; ii) reacting test compounds with SSSPI polypeptides and washed; 
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iii) detecting bound SSSPI polypeptides by methods known in the art. Alternatively, SSSPI 
polypeptides are coated directly onto plates or immobilized using non-neutralizing antibodies and 
used in the aforementioned screening techniques. This method is applied, for example, to 
detecting protease levels in a test solution or to screening for molecules that interact with SSSPI as 
5 discussed in the following embodiment. In another embodiment of the invention, binding 

complexes of SSSPI polypeptide and the aforementioned test agents are used in a method to screen 
for compounds that inhibit interaction of SSSPI polypeptide with serine protease substrates. This 
method comprises the steps: i) allowing SSSPI polypeptide-test agent binding complex to form; 
ii) adding SSSPI substrate (such as elastase); iii) measuring SSSPI binding to substrate directly or 

10 indirectly by methods common in the art (e.g., fluorescent labeling of the substrate molecule or of 
an antibody against said substrate). This method is applied, for example, to screening for 
molecules that inhibit SSSPI biological activity. 

In a preferred embodiment of the invention, a method of inhibiting protein degradation 
with a biologically active SSSPI polypeptide or a polynucleotide construct comprising 

15 polynucleotides encoding said polypeptide is provided. This method comprises the step of 

contacting a protein degradation-inhibiting effective amount of SSSPI polypeptide with proteins in 
a solution of appropriate pH and salt concentration to allow SSSPI biological activity (e.g., 
buffered saline). In an additional embodiment, SSSPI polypeptide is combined with other protease 
inhibitors and used in a method to inhibit protein degradation. This method comprises the steps: 

20 combining a protein degradation-inhibiting effective amount of SSSPI polypeptide with effective 
amounts of other protease inhibitors to form a protease inhibitor cocktail and contacting said 
cocktail with proteins in a solution of appropriate pH and salt concentration to allow SSSPI 
biological activity. Preferred protease inhibitors are of a different specificity than SSSPI to 
maximize the protease-inhibiting effectiveness of the cocktail, such as Kunitz-, trypsin inhibitor- 

25 like cystine-rich domain (TIL)-, thyroglobulin-, Kazal-, and netrin (NTR)- type protease inhibitors. 
Biologically acceptable salts of the SSSPI polypeptide also fall within the scope of the 
invention. The term "biologically acceptable salts" as used herein means an inorganic acid 
addition salt such as hydrochloride, sulfate, and phosphate, or an organic acid addition salt such as 
acetate, maleate, fumarate, tartrate, and citrate. Examples of biologically acceptable metal salts are 

30 alkali metal salts such as sodium salt and potassium salt, alkaline earth metal salts such as 
magnesium salt and calcium salt, aluminum salt, and zinc salt. Examples of biologically 
acceptable organic amine addition salts are salts with morpholine and piperidine. Examples of 
biologically acceptable amino acid addition salts are salts with lysine, glycine, and phenylalanine. 
Compounds provided herein can be formulated into "physiologically acceptable 

35 compositions" by admixture with physiologically acceptable nontoxic excipients and carriers. 

Such compositions may be prepared for use in parenteral administration, particularly in the form of 
liquid solutions or suspensions; oral administration, particularly in the form of tablets or capsules; 
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intranasally, particularly in the form of powders, nasal drops, or aerosols; dermally, via, for 
example, transdermal patches; or prepared in other suitable fashions for these and other forms of 
administration as will be apparent to those skilled in the art. 

Common excipients include, for example, sterile water or saline, polyalkylene glycols such 

5 as polyethylene glycol, oils of vegetable origin, and hydrogenated naphthalenes. Further excipient 
formulations include but are not limited to lactose, polyoxyethylene-9-lauryl ether, glycocholate, 
deoxycholate, salicylate, citric acid, oily or gel-like solutions and lipophilic emulsions. Potentially 
useful parenteral delivery systems for these active compounds include ethylene-vinyl acetate 
copolymer particles, osmotic pumps, implantable infusion systems, and liposomes. The invention 

10 can be employed as the sole active agent or can be used in combination with other active 
ingredients which could facilitate inhibition of serine proteases. 

Protease activity is associated with tumor formation by mechanisms that include 
proteolytic processing of growth factors (e.g., insulin-like growth factor, fibroblast growth factor 
(FGF), epidermal growth factor (EGF), heparin-binding epidermal growth factor-like growth 

15 factor, tumor necrosis factor (TNF)-alpha, and transforming growth factor (TGF)-beta). Indeed, 
SSSPI is capable of inhibiting proliferation of prostate carcinoma cells and pulmonary artery 
smooth muscle by preventing proteolytic processing of insulin-like growth factor II and FGF, 
respectively. In a preferred embodiment of the invention, a protein degradation-inhibiting 
effective amount of SSSPI polypeptide is contacted with cells to inhibit proteolytic processing and 

20 degradation of proteins. Preferred cells are those expressing growth factors that require proteolytic 
processing to promote proliferation, such as those listed above. Examples of preferred cells 
include those from the lung, gastrointestinal tract, liver, skin, mammary gland, pancreas, ovary, 
prostate gland, and vascular smooth muscle and epithelia. This method comprises the step of 
contacting a physiologically acceptable composition of SSSPI polypeptide with cells. Delivery of 

25 said composition to cells is accomplished as discussed above, as determined appropriate by one 
skilled in the art. 

An additional embodiment of the invention provides a method of introducing a 
polynucleotide construct comprising polynucleotides encoding SSSPI polypeptides to cells to 
inhibit proteolytic processing and degradation of proteins. Preferred cells are those expressing 

30 growth factors that require proteolytic processing to promote proliferation (e.g., insulin-like growth 
factor, FGF, EGF, heparin-binding epidermal growth factor-like growth factor, TNF-alpha, and 
TGF-beta) or cells that contact said cells. Examples of preferred cells include those from the lung, 
gastrointestinal tract, liver, skin, mammary gland, pancreas, ovary, prostate gland, and vascular 
smooth muscle and epithelia. Preferred polynucleotide constructs comprise polynucleotides 

35 encoding SSSPI polypeptide operably linked to an expression control element such as a promoter. 
Preferred expression control elements direct expression of SSSPI polypeptide in amount effective 
to inhibit protein degradation. Examples include the CMV promoter for constitutive expression or 
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a tissue-specific promoter, such as the human glandular kallikrein-2 promoter for expression in 
androgen receptor-positive prostate cancer cells. A physiologically acceptable composition 
comprising the polynucleotide construct is introduced to cells using methods common to the art 
such as electroporation or transfection. Additional delivery methods of said physiologically 
5 acceptable composition include but are not limited to: lipid vesicle delivery (including micelles, 
viral envelope components, lipsomes, and modified versions of these) as discussed in U.S. Patent 
61 10490, U.S. Patent 5019369, and P.C.T. 9704748, which disclosures are hereby incorporated by 
reference in their entireties; viral transduction (including attenuated lentiviral and adenoviral 
systems) as discussed in U.S. Patent 6204060, which disclosure is hereby incorporated by 

10 reference in its entirety; and delivery of a physiologically acceptable composition comprising 
naked polynucleotides (for example, to cells of the gastrointestinal tract) as discussed in U.S. 
Patent 6225290, which disclosure is hereby incorporated by reference in its entirety. 

SSSPI is capable of inhibiting serine proteases implicated in degenerative disorders 
including but not limited to thrombin, human leukocyte elastase, pancreatic elastase, trypsin, 

15 chymase, and cathepsin G. Thrombin is produced in the blood coagulation cascade and is 

implicated disorders such as thrombophlebitis, thrombosis, other bleeding disorders, and asthma. 
Human leukocyte elastase is implicated in tissue degenerative disorders such as rheumatoid 
arthritis, osteoarthritis, atherosclerosis, bronchitis, cystic fibrosis, and emphysema. Pancreatic 
elastase and trypsin are implicated soft tissue degradation, particularly in cases of pancreatitis. 

20 Chymase, an enzyme important in angiotensin synthesis, is implicated in disorders such as 
hypertension, myocardial infarction, and coronary heart disease. Cathepsin G is implicated in 
abnormal connective tissue degradation, particularly in the lung. In the extreme, serine proteases 
including but not limited to those mentioned above, kallikrein, and prostate specific antigen (PSA) 
are involved in tumor formation through proteolytic remodeling of extracellular matrix (ECM) 

25 proteins. This proteolytic remodeling may result in disruption of the integrity of tissue epithelial 
lining and basement membranes and result in metastasis, hi a preferred embodiment of the 
invention, a protein degradation-inhibiting effective amount of SSSPI polypeptides are applied to 
cells to inhibit protein degradation and resulting tissue or ECM degeneration. This method 
comprises the step of contacting a physiologically acceptable composition comprising SSSPI 

30 polypeptides with cells. Preferred cells include those diagnosed or at risk of degenerative disorders 
as a result of serine protease activity, such as those lung, gastrointestinal tract, liver,* skin, 
mammary gland, pancreas, ovary, prostate gland, bone and cartilage, and vascular smooth muscle 
and epithelia. Further preferred cells include those diagnosed or at risk of tumor invasion as a 
result of serine protease activity such as those involved in formation of epithelial linings, basement 

35 membranes, and ECM (e.g., epithelial cells and fibroblasts). Delivery of said composition to cells 
is accomplished as discussed above, as determined appropriate by one skilled in the art. 

In a further embodiment of the invention, SSSPI polypeptides or fragments thereof are 
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used in a method to detect serine proteases. This method is directed toward diagnosis of the 
aforementioned disorders and diseases. An example of this method comprises the steps of 
contacting SSSPI polypeptides with a biological fluids (e.g., cell culture media, blood, serum, cell 
suspensions or samples) suspected of containing serine proteases, washing, and detecting serine 

5 protease-SSSPI complexes. Detection of said complexes is accomplished by methods common to 
the art such as competition with a fluorescently-labeled neutralizing antibody. 
Protein of SEQ ID NO:44 (Internal designation Clone 147441 _106-025-2-0-Cll-F) 
The cDNA of Clone 147441 J06-025-2-0-C1 1-F (SEQ ID NO:43) encodes the 
CarboxyPeptidase Inhibitor-1 (CPI-1): 

10 MQGTPGGGTRPGPSPVDRRTLLWSF^ 

GLKPQKVDFWRGPARPSLPVDMRVPFSELKD (SEQ ID NO:44). Accordingly, it will be 
appreciated that all characteristics and uses of the polypeptides of SEQ ID NO:44 described 
throughout the present application also pertain to the polypeptides encoded by the nucleic acids 
included in Clone 14744 l_106-025-2-0-Cll-F. In addition, it will be appreciated that all 

1 5 characteristics and uses of the polynucleotides of SEQ ID NO:43 described throughout the present 
application also pertain to the nucleic acids included in Clone 14744 l_106-025-2-0-Cl 1-F. A 
preferred embodiment of the invention is directed toward the compositions of SEQ ID NO:43, 
SEQ ID NO:44, and Clone 147441J06-025-2-0-C1 1-F. Also preferred are polypeptide fragments 
. having a biological activity as described herein and the polynucleotides encoding the fragments. 

20 CPI-1 is a 91 amino acid protein that is highly homologous to the amino-terminal "prepro" 

region of preprocarboxypeptidase. The "pre" region represents a signal peptide while the "pro" 
region inhibits carboxypeptidase enzyme activity by binding to the active site of the enzyme before . 
being proteolytically removed. Proteolytic cleavage of procarboxypeptidase results in formation of 
mature, active carboxypeptidase. Proteolytic processing of procarboxypeptidase (e.g., by trypsin) 

25 relies on the carboxy-terminus of the "pro" region, which is absent in CPI-1 . CPI-1 therefore acts 
. as a small, independent inhibitor of carboxypeptidase activity that is not recognized by 
carboxypeptidase-specific proteases. Carboxypeptidases comprise a family of proteins that 
function in many physiological processes. These proteins remove a wide range of carboxyl- 
terminal amino acids, and in doing so are able to activate, inactivate, and modulate enzyme and 

,30 peptide hormone activity, as well as participate in peptide degradation and amino acid absorption. 
Active forms of mammalian carboxypeptidases may be secreted or located in lysosomes where 
they regulate intracellular protein processing, degradation and turnover. The "biological activity" 
of CPI-1 polypeptide is defined as the ability to inhibit carboxypeptidase activity. 
Carboxypeptidase activity may be measured by methods common to the art, such as incubation of 

35 a test sample with a radiolabeled Bolton-Hunter reagent-coupled peptide substrate (Normant, E., et 
al. (1995) Proc. Natl. Acad. Sci. 92:12225-9). "Carboxypeptidase" is used herein to refer to any 
member of the carboxypeptidase family. 
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Preferred embodiments of the present invention include: (1) a composition, comprising a 
CPI-1 polypeptide sequence of SEQ ID NO:44; (2) a composition, comprising a CPI-1 polypeptide 
fragment having a carboxypeptidase-inhibiting biological activity; (3) a composition, comprising a 
polynucleotide sequence of SEQ ID NO:43 encoding a CPI-1 polypeptide; (4) a composition, 
5 comprising a polynucleotide sequence encoding a carboxypeptidase-inhibiting biologically active 
CPI-1 polypeptide fragment. 

A method of inhibiting carboxypeptidase-mediated anti-fibrinolytic activity, comprising 
the step of: contacting an effective amount of a CPI-1 polypeptide or biologically active fragment 
thereof with carboxypeptidase in the bloodstream of an individual. Further preferably, CPI-1 
10 polypeptide is delivered to a human. 

A method of preventing or inhibiting the progression of carboxypeptidase-mediated 
pancreatitis, comprising the step of: contacting a CPI-1 polypeptide or biologically active 
fragment thereof with a pancreatic cell. 

A method of preventing or inhibiting the progression of carboxypeptidase-mediated 
15 pancreatic cancer, comprising the step of: contacting a CPI-1 polypeptide or biologically active 
fragment thereof with a pancreatic cell. 

A method of preventing or inhibiting the progression of carboxypeptidase-mediated lung 
cancer, comprising the step of: contacting a CPI-1 polypeptide or biologically active fragment 
thereof with a lung cell. 

20 A method of preventing or inhibiting the progression of carboxypeptidase-mediated 

ovarian cancer, comprising the step of: contacting a CPI-1 polypeptide or biologically active 
fragment thereof with an ovarian cell. 

A method of preventing or inhibiting the progression of carboxypeptidase-mediated larynx 
cancer, comprising the step of: contacting a CPI-1 polypeptide or biologically active fragment 
25 thereof with a larynx cell. 

A method of preventing or inhibiting the progression of carboxypeptidase-mediated uterine 
cancer, comprising the step of: contacting a CPI-1 polypeptide or biologically active fragment 
thereof with a uterine cell. 

A method of preventing or inhibiting the progression of carboxypeptidase-mediated 
30 hepatic cancer, comprising the step of: contacting a CPI-1 polypeptide or biologically active 
fragment thereof with a hepatic cell. 

A method of binding an antibody or antibody fragment to a CPI-1 polypeptide comprising 
the step of: contacting said antibody or antibody fragment with a biological sample. 

A method of using an antibody or antibody fragment that specifically binds CPI-1 
35 polypeptides or fragments thereof in a detection assay comprising the steps of: contacting said 
antibody or antibody fragment with a biological sample; and detecting antibody or antibody 
fragment binding to said sample. 
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A further preferred method comprises the additional step of: contacting a second antibody, 
or antibody fragment, that does not bind CPI-1 polypeptides or fragments thereof with said 
biological sample. 

Further preferably, the first and/ or second antibodies or antibody fragments are modified 
5 with detectable molecular tags. 

Further preferably, the biological sample is a blood sample or a tissue sample. 
Further preferably, the detection assay is used for purposes of diagnosis. 
A method of using an antibody or antibody fragment that binds CPI-1 polypeptides or 
fragments thereof to inhibit CPI-1 biological activity and facilitate carboxypeptidase activity, 
10 comprising the step of: contacting said antibody or antibody fragment with CPI-1 polypeptides or 
biologically active fragments thereof. 

The coagulation and fibrinolytic pathways are balanced to produce blood clotting and clot 
degradation, respectively, at appropriate times. Carboxypeptidase activity is anti-fibrinolytic, i.e., 
carboxypeptidase abrogates clot degradation, most likely by inhibiting plasminogen activation. In 
15 a preferred embodiment of the invention, a carboxypeptidase-inhibiting effective amount of a CPI- 
1 polypeptide, fragment thereof, or a polynucleotide encoding said polypeptide is used to inhibit 
carboxypeptidase-mediated blood clot formation and retention. This method may be directed 
toward facilitating anti-coagulant activity as desired in cases such as immobilization, 
thrombophilia, hereditary thrombophilia, stroke, myocardial infarction, coronary artery disease, 
20 malignant conditions, during and after surgical procedures, and in cases of increased risk of blood 
clots associated with medications. Preferably, this method is directed toward treatment of these 
conditions in a human. This method comprises the step of contacting a CPI-1 polypeptide or a 
biologically active fragment thereof with carboxypeptidase by administering a CPI-1 polypeptide 
to and individual. A preferred method of delivering CPI-1 polypeptides or biologically active 
25 fragments thereof to an individual includes direct, intravenous injection of said polypeptides or 
fragments in a physiologically acceptable solution (e.g., pH-buffered isotonic saline solutions, pH- 
buffered isotonic saline solutions modified by addition of viscous elements such as glycerol). 

An additional preferred method of delivering CPI-1 polypeptides or fragments to an 
individual comprises the step of introducing a polynucleotide construct comprising polynucleotides 
30 encoding CPI-1 polypeptides or biologically active fragments thereof into a cell. Preferred cells 
are those lining the bloodstream, such as vascular endothelial cells, vascular smooth muscle cells, 
and fibroblasts. Additional preferred cells are those that travel through the bloodstream, such as 
hematopoetic cells and their precursors, lymphocytes, macrophages, eosinophils, neutrophils, and 
red blood cells. Preferred polynucleotide constructs comprise an expression control element 
35 operably linked to polynucleotides encoding a CPI-1 polypeptide or biologically active fragment 
thereof. Examples of commercially available expression control units include but are not limited 
to a CMV promoter for constitutive expression or a tetracycline-repressible promoter for regulated 
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expression. Said polynucleotide construct is delivered to the cell by methods determined 
appropriate for the cell type. Delivery to cells that travel through the bloodstream may be 
accomplished by methods common to the art such as transfection or electroporation. Cells 
carrying the polynucleotide construct are then introduced to the bloodstream by, for instance, 
5 injection. Delivery to cells that line the bloodstream may be accomplished by methods including 
but not limited to lipid vesicles or viral transduction, as described in any one of the list: U.S. Patent 
5616565, U.S. Patent 6110490, U.S. Patent 6204060, andP.C.T. 9704748 which disclosures are 
hereby incorporated by reference in their entireties. Lipid vesicles may be derived from elements 
including but not limited to: viral envelopes, liposomes, micelles, and modified versions of these, 

10 as described in U.S. Patent 61 10490 or P.C.T. 9704748, which disclosures are hereby incorporated 
by reference in their entireties. Lipid vesicles or viruses may further be targeted to specific cells, 
for example, by embedding a member of a receptor-receptor ligand pair into the lipid envelope 
(e.g., VEGF/VEGFR for targeting to vascular endothelial cells). 

While carboxypeptidase activity is required for normal protein processing in the pancreas, 

15 higher than normal levels of activity lead to pancreatitis, or destruction and inflammation of the 
pancreas. Pancreatitis often leads to pancreatic cancer. Carboxypeptidase is active in the 
extracellular space of the pancreas as well as in vacuolar compartments such as lysosomes. In a 
preferred embodiment of the invention, a carboxypeptidase-inhibiting effective amount of CPI-1 
polypeptides, biologically active fragments thereof, or polynucleotides encoding said polypeptides 

20 are used to prevent or inhibit progression of pancreatitis or pancreatic cancer. This method 
comprises the step of contacting a physiologically acceptable solution comprising a CPI-1 
polypeptide or biologically active fragment thereof with a pancreatic cell. Said polypeptides may 
be delivered, for example, by implanting a CPI-1 polypeptide- releasing stent surgically or via 
catheter (U.S. Patent 5,500,013 and U.S. Patent 5,449,382, which disclosures are hereby 

25 incorporated by reference in their entireties). Polypeptides may further be delivered by direct 
injection (catheter or syringe) into the pancreatic organ. A further preferred method of delivering 
CPI-1 polypeptides or biologically active fragments thereof includes introducing a polynucleotide 
construct comprising polynucleotides encoding said polypeptides into a pancreatic cell. This 
method has the advantage of contacting CPI-1 polypeptides with intracellular compartments of 

30 carboxypeptidase activity. Said polynucleotide construct may further include an expression 
control element operably linked to polynucleotides encoding CPI-1 polypeptides or biologically 
active fragments thereof. Said polynucleotide construct may be delivered to a pancreatic cell by 
methods including but not limited to lipid vesicles or viral transduction, as described in any one of 
the list: U.S. Patent 5,616,565, U.S. Patent 6,110,490, U.S. Patent 6,204,060, and P.C.T. 9704748 

35 which disclosures are hereby incorporated by reference in their entireties. Lipid vesicles may be 
derived from elements including but not limited to the following list: viral envelopes, liposomes, 
micelles, and modified versions of these, as described in U.S. Patent 6,1 10,490 or P.C.T. 9704748, 
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which disclosures are hereby incorporated by reference in their entireties. Lipid vesicles or viruses 
may further be targeted to specific cells, for example, by embedding a member of a receptor- 
receptor ligand pair into the lipid envelope. 

Aside from pancreatic cancer, higher than normal levels of carboxypeptidase activity are 
5 found in cancers that include: lung, ovary, larynx, uterus, liver, stomach, and breast cancers. 
Carboxypeptidase activity leads to an increase in inflammatory cytokines, such as Tumor Necrosis 
Factor (TNF)-alpha. Therefore, carboxypeptidase-mediated tumorigenesis results from 
inflammation and destruction in a number of tissue types. As a preferred embodiment of the 
invention, a carboxypeptidase-inhibiting effective amount of CPI-1 polypeptides, biologically 

10 active fragments thereof, or polynucleotides encoding said polypeptides are used to prevent or 
inhibit progression of cancers. Preferred cancers include those listed above. This method 
comprises the step of contacting a physiologically acceptable solution comprising a CPI-1 
polypeptide or biologically active fragment thereof with a cell. Preferred cells include those of the 
lung, ovary, larynx, uterus, liver, stomach, and breast. Further preferred cells are those at risk of or 

15 displaying cancerous or precancerous pathology as is commonly determined by those skilled in the 
art (e.g., loss of contact inhibition, abnormal cell size or shape). CPI-1 polypeptides, biologically 
active fragments thereof, or polynucleotides encoding said polypeptides are delivered to a specific 
cell by methods common to the art such as those discussed herein. 

In an additional embodiment of the invention, CPI-1 polypeptides or fragments thereof are 

20 used to generate antibodies (or antibody fragments) that specifically bind to CPI-1 polypeptides or 
fragments thereof (detAbs for "detection antibodies") and/ or inhibit the biological activity of CPI- 
1 polypeptides or fragments thereof (inhAbs for "inhibitory antibodies"). Antibodies may be 
polyclonal or monoclonal and may be generated by any method known to one skilled in the art. 
In a preferred embodiment of the invention, antibodies or antibody fragments that 

25 specifically bind and inhibit CPI-1 biological activity (inhAbs) are used to facilitate 

carboxypeptidase activity. This method may be directed toward increasing carboxypeptidase- 
mediated anti-fibrinolytic activity for example, to prevent or treat bleeding disorders. This method 
may alternatively be directed toward increasing carboxypeptidase-mediated uptake of low density 
lipoprotein (LDL) particles by macrophages for example, to prevent or treat high blood pressure or 

30 atherosclerosis. This method comprises the step of contacting inhAbs with CPI-1. A preferred 
method of contact includes injection of a physiologically acceptable solution comprising inhAbs to 
the bloodstream of an individual at risk of or suffering from a bleeding disorder or high LDL 
levels. 

In a further preferred embodiment of the invention, antibodies or antibody fragments that 
35 bind CPI-1 polypeptides or fragments thereof (detAbs) are used in assays to bind and/ or detect 
CPI-1 polypeptides or fragments thereof. This method may be directed toward in vitro uses such 
as purification of CPI-1 or carboxypeptidase polypeptides for drug development. An example of 
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this method comprises the steps of: immobilizing a detAb on a solid or semi-solid matrix (e.g., 
sepharose); and exposing said immobilized detAb with a biological solution comprising proteins, 
preferably CPI-1 polypeptides or fragments thereof. This method may further be directed toward 
diagnosis of pancreatitis, pancreatic cancer, LDL-mediated disorders, and clotting disorders such 
5 as hemophilia, thrombophilia, hereditary thrombophilia, stroke, myocardial infarction, coronary 
artery disease, malignant conditions, and blood clots. This method comprises the steps of: 
contacting a detAb, preferably a detectably-labeled detAb (e.g., conjugated to a fluorescent tag), 
with a biological sample, preferably a tissue or blood sample; and detecting detAb binding to said 
sample. A further step of contacting a second antibody or antibody fragment that does not bind 

10 CPI-1 polypeptides or fragments thereof may be added to determine the specific nature of the 
protein detected by the first antibody or antibody fragment. The second antibody or antibody 
fragment is preferably labeled with a detectable molecular tag such as a fluorescent molecule. 
Further preferably, a different molecular tag than that used by the first antibody or antibody 
fragment is used with the second antibody or antibody fragment. 

15 Protein of SEQ ID No:46 (Internal designation Clone 124610_113«003-3-0-H5-F) 

The polypeptides of SEQ ID NO:46 are encoded by the polynucleotides of SEQ ID NO:45 
of Clone 124610_1 13-003-3-0-H5-F. It will be appreciated that all characteristics and uses of the 
polynucleotides of SEQ ID NO:45 and polypeptides of SEQ ID NO:46, described throughout the 
present application also pertain to the human cDNA of Clone 1246KM13-003-3-0-H5-F and the 

20 polypeptides encoded thereby. The gene of SEQ ID NO:45 is located on chromosome 17, encodes 
a human retinoic acid-inducible regulator of growth arrest and differentiation and is hereby 
referred to as RET-A-MODULIN comprising the polypeptide 

MTPSEGARAGTGRELEMLDSLLALG 
QVHILGCEVSEEEFREGFDSDINNRL^ 

25 DPWVTIALDSLSWLLLRLPCTTLCQVLHAVSHQDSCPGDSSSVGK^ 
PVGALSSLAQTEVTLGGTMGQASAHILCRRPRQRPTDQTQWFSILPD^ 
PYSDPHffPVSKNAKARTRKCSLVSGH^ A preferred embodiment 

of the invention is directed toward the compositions of SEQ ID NO:45, SEQ ID NO:46, and Clone 
124610_1 13-003 -3 -0-H5-F. Also preferred are polypeptide fragments having a biological activity 

30 as described herein and the polynucleotides encoding the fragments. 

A preferred embodiment of the invention is directed towards using compositions 
comprising RET-A-MODULIN and other preferred compositions in a method for inhibiting 
neoplastic cell growth, killing neoplastic cells and treating cancer. More particularly, the invention 
concerns methods and compositions to inhibit cellular proliferation of neoplastic cells, induce 

35 cytotoxicity in neoplastic cells and kill neoplastic cells (e.g., carcinomas, melanoma, and lymphoid 
tumors such as acute myelocytic leukemia (AML)), wherein said methods comprises contacting 
cells with a proliferation-inhibiting amount of RET-A-MODULIN or other sequences of the 
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invention. The method of suppressing neoplastic cell growth comprises the effects selected from 
the group consisting of: (a) inhibiting cell growth or proliferation; (b) killing said neoplastic cells; 
(c) inducing apoptosis in said neoplastic cells; (d) inducing necrosis in said neoplastic cells; 
(e) preventing or inhibiting neoplastic cell invasion; and (f) preventing or inhibiting neoplasticcell 

5 metastasis. In a preferred embodiment, the neoplastic are cancerous or from a tumor. In another 
aspect of the invention, said neoplastic cellsis selected from the group consisting of bladder 
carcinoma, hepatocellular carcinoma, hepatoblastoma, rhabdomyosarcoma, ovarian carcinoma, 
cervical carcinoma, lung carcinoma, breast carcinoma, squamous cell carcinoma in head and neck, 
esophageal carcinoma, thyroid carcinoma, astrocytoma, ganglioblastoma, neuroblastoma, 

10 lymphoma, myeloma, sarcoma and neuroepithelioma. In yet another aspect of the invention, said 
neoplastic cells are malignant or benign. Further included in the invention are the following protein 
sequences: 

MLDSLLALGGLVLLRDSVEWEGRSLLKALVKKSALCGEQVHILGCEVSEEEFREGFDSDI 
NNRLVYHDFFRDPLNW 
15 TLCQVLHAVSHQDSCPGDSSSVGKVSVLGLLHEELHGPGPVGALSSLAQTEVTLGGTMGQ 
ASAHILCRRPRQRPTDQTQWFSILPDFSLDLQEGPSV^ 

KKEREARDSLILPFQFSSEK(^ALLRPPJ > GQATSHIFYEPDAYYDLDQEDPDDDLDI, 
IvnLDSLLAIGGLVLLRDSVEWEGRSLLKALIKKSALRGEQVHVLGCEVSEEEFREGFDSDV 
NSRLVYHDLFRDPLNWSKPGEAWEGPLKALRSMCKRTDHGSVTIALDSLSWLLCHIPCV 
20 TLCQALHALSQQNGDPGDNSLVEQVHVLGLLHEEUIGPGSMGALNTLAHTEVTLSGKV^ 
QTSASILCRRPQQRATYQTWWSVLPDFS 
HLSKKEREARDSLTLPFQFSSEKQKALLHPWS 

SLIJCALIKKSALRGEQVHVLGCEVSEEEFREGFDSDVNSRLVYHDLFRDPLNWS 
PEGPLKAI^MCKRTDHGSVTIALDSLSWLLCHIPCVTLCQALHALSQQNGDPGDNSLVE 
25 QWVLGLLHEELHGPGSMGAimLAmEVTLSGKVDQTSASILCRPJ'QQRATYQTWWS 
VLPDFSLTLHEGLPLRSELHPDHHTTQVDCT 
KALLHPVPSRTTGHIFYEPDAFDDVDPEDPDDDLDI, 
MLDSLLAIGGLVLLRDSVEWEGR5LLKALIKKSALRGEQVHVLG 

NSRLVYHDLFRDPLNWSKPGEAWEGPLKALRSMCKRTDHGSVTIALDSLSWLLCHIPCV 
30 TLCQALHALSQQNGDPGDNSLVEQVHVLGLLHEELHGPGSMGALNTLAHTEVTLSGKVD 
QTSASILCRPJ > QQRATYQTWWSVLPDFSLTLHEGLPLRSELHPDHHTTQVDPTAHL 
HLSKKEREARDSLTLPFQFSSFJCQKALLHPWSRTTGRH^PDAFDDVDQEDPDDDLDI, 
and 

MGTPGEGLGRCSHALIRGVPESLASGEGAGAGLPALDLAKAQREHGVLGGKLRQRLGLQ 
35 LLELPPEESLPLGPLLGDTAVIQGDTALrr^ 

NATLDGTDVLFTGREFFVGLSKWTNHRGAEIVADTFPJ^FAVSTWVSGSSHLRGLCGMGG 
PRTWAGSSEAAQKAVRAMAALTDHPYASLTLPDDAASDCLFI.PJ'GLPGATPFLIJIRGGS 
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AEAL. 

These embodiments also comprise the death effector domain of RET-A-MODULIN, and 
other death effector domains including peptides 

LVKKSALCGEQVHIL, LVKRHRLATMPPMV, LGWLCLLLLPIPLI, LHSDSGISVDSQSL, 
5 LPAGDRLTGIPSHI, LLLPLVLRALLVDV, LQPGPQLYDVMDAV, LDCVRLLLQYDAEI, 
LDCVRLLLQYNAEI, LLEQNDLEPGHTEL, LLEQNDLERGHTGL, MDGPRLLLLLLLGV 
MDRLRLLLLLILGV, LKPENILVDNDFHI, LKPENBLVDRDFHI, LLLPLVLLELLVGI, 
LLLSLVLLALLMGI, LLLSLVLLALLMGI, LWALLILLIPIVLI, LWLLTILVLLIPLV, 
LLPLPVRAQLCAHL, WTELARELDFTEEQIH, WRRLARQLKVSDTKID, 

10 WKRLARELKVSEAKMD, WHQLHGKKEAYDTLIK, WRQLAGELGYKEDLID, 
WEPMVLSLGLSQTDIY, WAELARELQFSVEDIN, WAELARELQFSVEDIN, 
WRHLAGELGYQPEHED, WRHLAGELGYQPEHID, WKNCARKLGFTQSQID, 
WKNCARKLGFTESQID, WKEFVRRLGLSDHEED, WKEFMRFMGLSEHEIE, 
WKEFVRRLGLSEHEIE, WKEFMRLLGLSEHEBE, WKEFVRTLGLREAEEE, 

15 VKEFVRKNGMEEAKID, CWYQSHGKSDAYQDL1K, WQQLATAVKLYPDQVE. 

A preferred embodiments of the invention comprise physiologically acceptable 
compositions and methods of treating cancer in a patient (such as prostate cancer, skin 
cancer/melanoma, pancreatic carcinoma, colon cancer, melanoma, ovarian cancer, liver cancer, 
small cell lung carcinoma, non-small cell lung carcinoma, cervical cancer, breast cancer, bladder 

20 cancer, brain cancer, neuroblastoma/glioblastoma, leukemia, lymphoma, head and neck cancer, 
kidney cancer, myeloma and ovarian cancer) characterized by proliferation of neoplastic cells 
which comprises administering to the patient an amount of a polypeptide of the invention, effective 
to: (a) selectively induce apoptosis and/or necrosis in such neoplastic cells and thereby inhibit 
their proliferation; (b) inhibit cell growth and proliferation of the neoplastic cells; (c) inhibit 

25 invasion of the neoplastic cells; (d) inhibit metastasis of the neoplastic cells; (e) kill neoplastic 
cells; (f) preferentially inhibit cell growth and proliferation of the neoplastic cells; and 
(g) preferentially kill neoplastic cells. RET-A-MODULIN or other proteins of the invention or 
fragments thereof can be used in combination with one or more of various anticancer agents known 
as cancer chemotherapeutic agents and/or radiation therapy. The active ingredient compound of 

30 the invention which can produce an excellent anticancer effect can thus markedly promote the 
effect of the other anticancer agent or agents used in combination, to produce a synergistic effect. 
Therefore, even when the partner anticancer agent or agents are used in doses much smaller than 
the usual doses, a satisfactory anticancer effect can be obtained, whereby the adverse effects of the 
partner anticancer agent or agents can be minimized. As such chemotherapeutic agents included 

35 but not limited to, for example, 5-fluorouracil (5-FU; Kyowa Hakko Kogyo), mitomycin C 

(Kyowa Hakko Kogyo), futraful (FT-207; Taiho Pharmaceutical), endoxan (Shionogi & Co.) and 
toyomycin (Takeda Chemical Industries), hi addition, the apoptosis regulating composition of the 
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present invention may be administered with a vitamin D derivative to further enhance its cytotoxic 
characteristics (United States Patent 6,087,350). The anti-cancer agents of the present invention 
may be combined with an anti-oestrogen compound such as tamoxifen or anti-progesterone such as 
onapristone (see, EP 616812) in dosages known for such molecules. 

5 The pharmaceutically and physiologically acceptable compositions utilized in this 

invention may be administered by any number of routes including, but not limited to, parenteral, 
subcutaneous, intracranial, intraorbital, intracapsular, intraspinal, intracisternal, intrapulmonary 
(inhaled), oral, intravenous, intramuscular, intra-arterial, intramedullary, intrathecal, 
intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical, sublingual, 

10 or rectal means. In addition to the active ingredients, these pharmaceutically and physiologically 
acceptable compositions may contain suitable physiologically acceptable carriers comprising 
excipients and auxiliaries, which facilitate processing of the active compounds into preparations, 
which can be used pharmaceutically. Further details on techniques for formulation and 
administration may be found in the latest edition of Remington's Pharmaceutical Sciences (Maack 

15 PublishingCo. Easton, Pa). Pharmaceutically and physiologically acceptable compositions for oral 
administration can be formulated using physiologically acceptable carriers well known in the art in 
dosages suitable for oral administration. Such carriers enable the pharmaceutically and 
physiologically acceptable compositions to be formulated as tablets, pills, dragees, capsules, 
liquids, gels, syrups, slurries, suspensions, and the like, for ingestion by the patient. Pharmaceutical 

20 preparations for oral use can be obtained through a combination of active compounds with solid 
excipient, suiting mixture is optionally grinding, and processing the mixture of granules, after 
adding suitable auxiliaries, if desired, to obtain tablets or dragee cores. Suitable excipients are 
carbohydrate or protein fillers, such as sugars, including lactose, sucrose, mannitol, or sorbitol; 
starch from corn, wheat, rice, potato, or other plants; cellulose, such as methyl cellulose, 

25 hydroxypropylmethyl-cellulose, or sodium carboxymethylcellulose; gums including arabic and 
tragacanth; and proteins such as gelatin and collagen. If desired, disintegrating or solubilizing 
agents may be added, such as the cross-linked polyvinyl pyrrolidone, agar, alginic acid, or a salt 
thereof, such as sodium alginate. 

Dragee cores may be used in conjunction with suitable coatings, such as concentrated 

30 sugar solutions, which may also contain gum arabic, talc, polyvinylpyrrolidone, carbopol gel, 
polyethylene glycol, and/or titaniumdioxide, lacquer solutions, and suitable organic solvents or 
solvent mixtures. Dyestuffs or pigments may be added to the tablets or dragee coatings for product 
identification or to characterize the quantity of active compound, i.e., dosage. 

Pharmaceutical preparations, which can be used orally, include push-fit capsules made of 

35 gelatin, as well as soft, sealed capsules made of gelatin and a coating, such as glycerol or sorbitol. 
Push-fit capsules can contain active ingredients mixed with filler or binders, such as lactose or 
starches, lubricants, such as talc or magnesium stearate, and, optionally, stabilizers. In soft 
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capsules, the active compounds may be dissolved or suspended in suitable liquids, such as fatty 
oils, liquid, or liquid polyethylene glycol with or without stabilizers. Pharmaceutical formulations 
suitable for parenteral administration may be formulated in aqueous solutions, preferably in 
physiologically compatible buffers such as Hanks solution, Ringer ! s solution, or physiologically 
5 buffered saline. Aqueous injection suspensions may contain substances, which increase the 
viscosity of the suspension, such as sodium carboxymethylcellulose, sorbitol, or dextran. 
Additionally, suspensions of the active compounds may be prepared as appropriate oily injection 
suspensions. Suitable lipophilic solvents or vehicles include fatty oils such as sesame oil, or 
synthetic fatty acid esters, such as ethyl oleate or triglycerides, or liposomes. Optionally, the 

10 suspension may also contain suitable stabilizers or agents, which increase the solubility of the 
compounds to allow for the preparation of highly, concentrated solutions. For topical or nasal 
administration, penetrants appropriate to the particular barrier to be permeated are used in the 
formulation. Such penetrants are generally known in the art. The pharmaceutically and 
physiologically acceptable compositions of the present invention may be manufactured in a 

15 manner that is known in the art, e.g., by means of conventional mixing, dissolving, granulating, , 
dragee-making, levigating, emulsifying, encapsulating, entrapping, or lyophilizing processes. 

The pharmaceutical composition may be provided as a salt and can be formed with many 
acids, including but not limited to, hydrochloric, sulfuric, acetic, lactic, tartaric, malic, succinic, 
etc. Salts tend to be more soluble in aqueous or other protonic solvents than are the corresponding 

20 free base forms. In other cases, the preferred preparation may be a lyophilized powder which may 
contain any or all of the following: 1-50 mM histidine, 0.1%-2% sucrose, and 2-7% mannitol, at a 
pH range of 4.5 to 5.5, that is combined with buffer prior to use. After pharmaceutically and 
physiologically acceptable compositions have been prepared, they can be placed in an appropriate 
container and labeled for treatment of an indicated condition. For administration of RET-A- 

25 MODULIN, such labeling would include amount, frequency, and method of administration. 
Pharmaceutically and physiologically acceptable compositions suitable for use in the invention 
include compositions wherein the active ingredients are contained in an effective amount to 
achieve the intended purpose. The determination of an effective dose is well within the capability 
of those skilled in the art. For any compound, the therapeutically effective dose can be estimated 

30 initially either in cell culture assays, e.g., of neoplastic cells, or in animal models, usually mice, 
rabbits, dogs, or pigs. The animal model may also be used to determine the appropriate 
concentration range and route of administration. Such information can then be used to determine 
useful doses and routes for administration in humans. Those of ordinary skill in the art are well 
able to extrapolate from one model (be it an in vitro or an in vivo model). A therapeutically 

35 effective dose refers to that amount of active ingredient, for example RET-A-MODULIN 
polypeptides or other proteins of the invention or fragments thereof, which ameliorates the 
symptoms or condition. Therapeutic efficacy and toxicity may be determined by standard 
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pharmaceutical procedures in cell cultures or experimental animals, e.g., ED50 (the dose 
therapeutically effective in 50% of the population) and LD50 (the dose lethal to 50% of the 
population). The dose ratio between therapeutic and toxic effects is the therapeutic index, and it 
can be expressed as the ratio, LD50/ED50. Pharmaceutically and physiologically acceptable 
5 compositions, which exhibit large therapeutic indices, are preferred. The data obtained from cell 
culture assays and animal studies is used in formulating a range of dosage for human use. The 
dosage contained in such compositions is preferably within a range of circulating concentrations 
that include the ED50 with little or no toxicity. The dosage varies within this range depending 
upon the dosage form employed, sensitivity of the patient, and the route of administration. The 

10 practitioner, in light of factors related to the subject that requires treatment, will determine the 
exact dosage. Dosage and administration are adjusted to provide sufficient levels of the active 
moiety or to maintain the desired effect. Factors, which may be taken into account, include the 
severity of the disease state, general health of the subject, age, weight, and gender of the subject, 
diet, time and frequency of administration, drug combination(s), reaction sensitivities, and 

1 5 tolerance/response to therapy. Long-acting pharmaceutically and physiologically acceptable 
compositions maybe administered every 3 to 4 days, every week, or once every two weeks 
depending on half-life and clearance rate of the particular formulation. Normal dosage amounts 
may vary from 0.1 to 100,000 micrograms, up to a total dose of about 1 g, depending upon the 
route of administration. Guidance as to particular dosages and methods of delivery is provided in 

20 the literature and generally available to practitioners in the art. Those skilled in the art will employ 
different formulations for nucleotides than for proteins or their inhibitors. Similarly, delivery of 
polynucleotides or polypeptides will be specific to particular cells, conditions, locations, etc. For 
the prevention or treatment of disease, the appropriate dosage of an anti-tumor agent herein will 
depend on the type of disease to be treated, as defined above, the severity and course of the 

25 disease, whether the agent is administered for preventive or therapeutic puiposes, previous therapy, 
the patient's clinical history and response to the agent, and the discretion of the attending 
physician. The agent is suitably administered to the patient at one time or over a series of 
treatments. Animal experiments provide reliable guidance for the determination of effective doses 
for human therapy. Interspecies scaling of effective doses can be performed following the 

30 principles laid down by Mordenti, J. and Chappell, W. "The use of interspecies scaling in 
toxicokinetics" in Toxicokinetics and New Drug Development, Yacobi et al , eds., Pergamon 
Press, New York 1989, pp. 42-96. For example, depending on the type and severity of the disease, 
about 1 g/kg to 15 mg/kg (e.g., 0.1-20 mg/kg) of an antitumor agent is an initial candidate dosage 
for adrninistration to the patient, whether, for example, by one or more separate administrations, or 

35 by continuous infusion. A typical daily dosage might range from about 1 g/kg to 100 g/kg or 
more, depending on the factors mentioned above. For repeated administrations over several days 
or longer, depending on the condition, the treatment is sustained until a desired suppression of 
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disease symptoms occurs. However, other dosage regimens may be useful. The progress of this 
therapy is easily monitored by conventional techniques and assays. Guidance as to particular 
dosages and methods of delivery is provided in the literature; see, for example, U.S. Pat. Nos. 
4,657,760; 5,206,344; or 5,225,212. It is anticipated that different formulations will be effective 
5 for different treatment compounds and different disorders, that administration targeting one organ 
or tissue, for example, may necessitate delivery in a manner different from that to another organ or 
tissue. Therapies may be designed to utilize RET-A-MODULIN cytotoxic properties. In 
particular, therapies to enhance RET-A-MODULIN expression or administration of said 
polypeptides are useful in promoting inhibition or death of cancerous cells. Cytotoxic reagents 

10 may include, without limitation, full length or fragment RET-A-MODULIN polypeptides, mRNA, 
or any compound, which increases RET-A-MODULIN biological activity. 

Another therapeutic approach within the invention involves administration of RET-A- 
MODULIN therapeutic compositions (polynucleodtide, antibody, small molecule agonist or 
recombinant RET-A-MODULIN polypeptide), either directly to the site of a desired target cell or 

1 5 tissue (for example, by injection) or to a site where the composition will be further directed to the 
target cell or tissue, or systemically (for example, by any conventional recombinant protein 
administration technique). The dosage of RET-A-MODULIN depends on a number of factors, 
including the size and health of the individual patient, but, generally, between 0.1 mg and 100 mg 
inclusive is administered per day to an adult in any physiologically acceptable formulation. 

20 In another embodiment, RET-A-MODULIN polypeptides and nucleic acid sequences find 

diagnostic use in the detection or monitoring of conditions involving aberrant levels of apoptosis. 
For example, decreased expression of RET-A-MODULIN may be correlated with decreased 
apoptosis in humans. Accordingly, a decrease or increase in the level of RET-A-MODULIN 
production may provide an indication of a deleterious condition. Levels of RET-A-MODULIN 

25 expression may be assayed by any standard technique such as Northern blot analysis and RT-PCR 
in biopsy specimen. 

These embodiments comprise methods for detection of RET-A-MODULIN-mediated 
proliferation inhibition and apoptosis including in vitro activity tests of RET-A-MODULIN or 
other proteins of the invention or fragments thereof, further cellular proliferation assays, and 

30 cellular apoptosis/necrosis assays. Specific examples of apoptosis assays are also provided in the 
following references. Assays for apoptosis in lymphocytes are disclosed by Noteborn et al., US 
Patent 5,981,502, 1999, Li et al, "Induction of apoptosis in uninfected lymphocytes by HIV-1 Tat 
protein", Science 268: 429-431, 1995; Gibellini et al, "Tat-expressing Jurkat cells show an 
increased resistance to different apoptotic stimuli, including acute human immunodeficiency virus- 

35 type 1 (HIV-1) infection", Br. J. Haematol. 89: 24-33, 1995; Martin et aL % "HIV-1 infection of 
human CD4.sup.+ T cells in vitro. Differential induction of apoptosis in these cells." J. Immunol. 
152:330-342, 1994; Terai et al, "Apoptosis as a mechanism of cell death in cultured T 



178 



WO 02/094864 



PCT/IB01/01715 



lymphoblasts acutely infected with HIV-l", L Clin Invest. 87: 1710-1715, 199 1; Dhein et aL, 
"Autocrine T-cell suicide mediated by APO-l/(Fas/CD95)", Nature 373: 438-441, 1995; Katsikis 
et aL, "Fas antigen stimulation induces marked apoptosis of T lymphocytes in human 
immunodeficiency virus-infected individuals", J. Exp. Med 1 815:2029-2036, 1995; Westendorp et • 
5 aL, "Sensitization of T cells to CD95-mediated apoptosis by HIV-l Tat and gpl20", Nature 
375:497, 1995; DeRossi et aL, Virology 198:234-244, 1994. Assays for apoptosis in fibroblasts 
are disclosed by: Vossbeck et aL, "Direct transforming activity of TGF-beta on rat fibroblasts", Int. 
J. Cancer 61:92-97, 1995; Goruppi et aL, "Dissection of c-myc domains involved in S phase 
induction of NIH3T3 fibroblasts", Oncogene 9:1537-44, 1994; Fernandez et aL, "Differential 

10 sensitivity of normal and Ha-ras transformed C3H mouse embryo fibroblasts to tumor necrosis 
factor: induction of bcl-2, c-myc, and manganese superoxide dismutase in resistant cells", 
Oncogene 9:2009-2017, 1994; Harrington et aL, "c-Myc-induced apoptosis in fibroblasts is 
inhibited by specific cytokines", EMBO J. 13:3286-3295, 1994; Itoh et aL, "A novel protein 
domain required for apoptosis. Mutational analysis of human Fas antigen", J. Biol. Chem. 

15 268: 10932-10937, 1993. In vitro cellular proliferation assays comprise cultured cells such as 
Jurkat, HepG2, K562, or HeLa, which are treated with RET-A-MODULIN or fragments thereof at 
concentration ranges for example from 0.5 to 25 ug/mL, and percent decrease in cellular 
proliferation is measured 24, 48, and 72 hours after treatment Cellular apoptosis is measured 
using an apoptosis assay kit such as VYBEANT™ Apoptosis Assay Kit #3 (Molecular Probes). 

20 After harvesting and washing, cells are stained with a FITC-labeled anti-RET-A-MODULIN 
antibody and analyzed by FACS according to manufacturer's instructions. Cells will be stained 
with PI or DAP1 to detect apoptotic nuclei. DNA fragmentation analysis will be performed by 
cellular DNA extraction and Southern blot analysis using about 1 ug of DNA and hybridized with 
randomly primed 32 P-labeled chromosomal DNA from said cells, which had not been treated, with 

25 RET-A-MODULIN. 

These embodiments also comprise the production of RET-A-MODULIN or other proteins 
of the invention or fragments thereof by subcloning of said nucleotides into an expression vector 
such as pCMV-neo for transfection assays, Western blot analysis to measure protein expression, 
and detection of RET-A-MODULIN-induced apoptosis by indirect immunofluorescence and DNA 

30 fragmentation analysis. Also included in the invention is the generation of specific antibodies 
against RET-A-MODULIN or other proteins of the invention or fragments thereof according to 
methods described in the art, wherein said antibodies can be polyclonal or monoclonal. 

RET-A-MODULIN also shares homologies with two phosphorylated matrixproteins with 
the human cytomegalovirus, a pathogenic herpesvirus causing complications in patients with 

35 suppressed cellular immune functions and in prenatal infections (Ruger et aL, J Virology 61 :446- 
453, 1987, Koretz et aL, N.Engl.LMed.314:801-805, 1986, Bowden et aL, 
N.EnglJ.Med.314:1006-1010, 1986). A preferred embodiment comprises the use of RET-A- 
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MODULIN and fragments thereof including 

GPGPVGALSSLAQTEVTLG, EGPSVESQPYSD, EVSEEEFREGFDSDINN, 
TTLCQVLHAVSHQDSCPGDSSSVGKVSVLGLLHEELHGPGPVGALS, GPSVESQPYSD, 
CQVLHAVSH, GKVSVLGLLHEELHGPGPV 
5 for vaccination against Herpesvirus infections, as well as a vaccine preparation against 
Herpesviruses such as human cytomegalovirus (HCMV) and Kaposi Sarcoma-Associated 
Herpesvirus/Human Herpesvirus 8, which preparation comprises a RET-A-MODULIN protein or 
protein part according to the invention and optionally one or more carriers and adjuvants suitable 
for subunit vaccines. The use of a RET-A-MODULIN protein or protein part as defined above in a 

10 process for producing RET-A-MODULIN-specific polyclonal or monoclonal antibodies also falls 
within the scope of the invention. Vaccination and immunization generally refer to the 
introduction of a non-virulent agent against which an individual's immune system can initiate an 
immune response, which will then be available to defend against challenge by a pathogen. The 
immune system identifies invading "foreign" compositions and agents primarily by identifying 

1 5 proteins and other large molecules that are not normally present in the individual. The foreign 
protein represents a target against which the immune response is made. A further example is a use 
of RET-A-MODULIN-specific antibodies according to the invention for passive immunization 
against Herpesvirus infections, as well as an immunization preparation for passive immunization 
against Herpesvirus infections, which preparation includes RET-A-MODULIN-specific antibodies 

20 according to the invention and optionally one or more carriers and adjuvants suitable for passive 
immunization preparations. 

As regards preparative applications, one example is the use of RET-A-MODULIN-specific 
antibodies according to the invention in a process for isolating and/or purifying RET-A- 
MODULIN. Routes of administration include, but are not limited to, intramuscular, 

25 intraperitoneal, intradermal, subcutaneous, intravenous, intraarterially, intraocularly and oral as 
well as transdermally or by inhalation or suppository. Preferred routes of administration include 
intramuscular, intraperitoneal, intradermal and subcutaneous injection as described by Pachuk et 
al., US Patent 6,235,888 (2001); see also Noteborn et al., US Patent 6,238,669 (2001), Patel et al., 
Diagnostic Molecular Pathology 10:95-99 (2001), and Aoki and Tosato, Leuk Lymphoma, 41:229- 

30 237 (2001), which references are hereby incorporated in their entirety. 

Proteins of SEQ ID NO:48 (Internal designation Clone 1000855165_205-99-l-0-A5-F) and 
SEQ ID NO:52 (Internal designation Clone 500721700_204-43-4-0-H10-F) 

The cDNA of clone 1000855 1 65^205-99-1 -0-A5-F (SEQ ID:47) encodes the protein of 
SEQ ID NO:48 comprising the amino acid sequence: 

35 MIYTMBCKVHALWASVCXLLNLAPAPLNADSEE 
GPCKAIMKRITTMFTRQC 
GYTTRYFY1WQTKQCERFKYG 
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NNSLTPQSTKWSLFEFHGPSWCLTPADRGLCRANENRFYYNSV 

NNFTSKQECLRACKKGFIQRISKGGL Accordingly, it 

will be appreciated that all characteristics and uses of polypeptides of SEQ ID NO:48 described 
throughout the present application also pertain to the polypeptides encoded by the nucleic acids 

5 included in Clone 1 000855 165_205-99-l-0-A5-F. In addition, it will be appreciated that all 
characteristics and uses of the polynucleotides of SEQ ID NO:47 described throughout the present 
application also pertain to the nucleic acids included in Clone 1000855 165_205-99-l-0-A5-F. A 
preferred embodiment of the invention is directed toward the compositions of SEQ ID NO:47, 
SEQ ID NO:48, and Clone 1 000855 165_205-99-l-0-A5-F. Also preferred are polypeptide 

10 fragments having a biological activity as described herein and the polynucleotides encoding the 
fragments. 

The cDNA of clone 500721700_204-43-4-0-H10-F (SEQ ID:51) encodes the protein of 
SEQ ID NO: 52 comprising the amino acid sequence: 
MTYTMKKVHALW 
15 GPCKAIMKRFFFNIFTRQCEEFrc 
GYTTRYFYNNQTKQCERFKYG^ 
1WSLTPQSTKWSLFEFHGPSW 

NNFTSKQECLRACKKGFIQRE^ Accordingly, it 

will be appreciated that all characteristics and uses of polypeptides of SEQ ID NO:52 described 

20 throughout the present application also pertain to the polypeptides encoded by the nucleic acids 
included in Clone 500721700_204-43-4-0-H10-F. In addition, it will be appreciated that all 
characteristics and uses of the polynucleotides of SEQ ID NO:5 1 described throughout the present 
application also pertain to the nucleic acids included in Clone 500721700_20443-4-0-H10-F. A 
preferred embodiment of the invention is directed toward the compositions of SEQ ID NO:5 1 , 

25 SEQ ID NO:52, and Clone 500721700 J204-43^M)-H10-F. Also preferred are polypeptide 
fragments having a biological activity as described herein and the polynucleotides encoding the 
fragments. 

The protein of SEQ ID NO:48 encodes Tifapinix. The protein of SEQ ID NO:52 encodes 
Tifapinix-A58S. Tifapinix-A58S differs from Tifapinix in having serine at position 58 rather than 

30 alanine (A58S) (numbered from the initiating methionine of Tifapinix). It will be appreciated that 
the specification, composition, and embodiments directed herein to Tifapinix also are given to be 
directed as well to Tifapinix-A58S. Furthermore, it will also be appreciated that in said 
specification, composition, and embodiments directed to any polypeptide of Tifapinix wherein said 
polypeptide includes alanine at position 58, that said specification, composition, and embodiments 

35 given to be directed as well to the corresponding polypeptide of Tifapinix include amino acid 
serine at position 58. 

Tifapinix is a novel splice variant of tissue factor pathway inhibitor (TFPI-1). Tissue 
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factor (TF) initiates the extrinsic coagulation pathway (US Patent 5,849,875; US Patent 5,106,833; 
US Patent 6,103,499; US Patent 5,773,251; US Patent 5,994,125, 1999, which disclosures are 
hereby incorporated by reference in their entirety). TFPI-1 is also known as lipoprotein associated 
coagulation inhibitor (LACI), so named because of its affinity for plasma lipoprotein. 
5 Tifapinix has novel function as described below. 

TFPI-1 is a secreted trivalent Kunitz-type plasma proteinase inhibitor that negatively 
regulates the initiation of coagulation by producing activated factor X (FXa) feedback inhibition of 
the catalytic complex of activated factor VII (FVUa) and TF. The second Kunitz domain of TFPI- 
1 binds and inhibits FXa, whereas the first Kunitz domain is responsible for the inhibition of FVUa 

10 in the TF-FVUa complex. The linker region between Kunitz domains 1 and 2 of TFPI-1 is 
comprised of 20 amino acids (US Patent 5,849,875 which disclosures is hereby incorporated by 
reference in its entirety): TRDNANREKTTLQQEKPDF. The function of the third Kunitz 
domain is unknown, although there is evidence that it contains a heparin binding site. Heparin 
binding site(s) have also been mapped carboxyl-terminal to the third Kunitz domain. 

15 TFPI-1 directly inhibits FXa and, in a FXa-dependent fashion, produces feedback 

inhibition of the TF-FVIIa catalytic complex. TFPI-1 is the major inhibitor of the protease activity 
of the TF-FVUa complex. The allosteric promotion of TF-FVUa binding by Kunitz domain 1 on 
FXa binding to Kunitz domain 2 presumably is carried out at least in part through the linker region 
between Kunitz domains 1 and 2. The finding that the Kunitz domain 2, which binds FXa, is 

20 required for inhibition of the TF-VIIa complex has led to the proposal that TFPI-1 inhibits TF- 
FVIIa by forming a quaternary TF-FVHa-FXa-TFPI-l complex. The formation of a quaternary 
complex can result from either the initial binding of TFPI-1 to FXa, with subsequent binding to the 
TF-VIIa complex or, alternatively, TFPI-1 could bind directly to a preformed TF-FVHa-FXa 
comples. The consequence of the formation of the quaternary complex is that TF can no longer 

25 participate in initiating coagulation. 

Aside from it role in coagulation, FXa plays a role in inflammation. FXa generated by TF- 
FVIIa has been shown to lead to pro-inflammatory activation of vascular endothelial cells through 
its cleavage of protease-activated receptor 2 (PAR2) (Camerer, E et al., Proc. Natl. Acad. Sci. USA 
97:5255-60 (2000) which disclosure is hereby incorporated by reference in its entirety). FXa can 

30 also elicit a pro-inflammatory cellular response by cleavage of protease-activated receptor 1 
(PARI) (Kravchenko, RM Blood 97:3 109-16 (2001) which disclosure is hereby incorporated by 
reference in its entirety). HLA-DR-restricted macrophage expression of TF in rheumatoid 
synovium is believed to play a role in disease pathogenesis in part through generation of FXa 
(Dialynas DP et al., Arthritis and Rheumatism 41:1515-6 (1998) which disclosure is hereby 

35 incorporated by reference in its entirety). 

TF is a bifunctional molecule capable of inducing both fibrin deposition and angiogenesis 
in cancer. Cancer patients are prone to venous thromboembolism, and this hypercoagulability 
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favors tumor growth and metastasis. In human lung cancer, melanoma, and breast cancer, TF and 
vascular endothelial growth factor (VEGF) co-localize in tumor cells; a close correlation exists 
between TF and VEGF synthesis in tumor cell lines and with angiogenesis in vivo in a severe, 
combined irmnunodeficient mouse model (Rickles, FR et al., Int. J. Hematol. 73:145-50 (2001); 
5 Wojtukiewicz MZ et al., Thromb. Haemost. 82: 1659-62 (1999); Abdulkadir SA, et al., Hum. 
Pathol. 31:443-7 (2000); Koomagi R et al., Int. J. Cancer 79:19-22 (1998) which disclosures are 
hereby incorporated by reference in their entirety). 

TF supports metastasis (Mueller BM et al., J. Clin. Invest. 101:1372-8 (1998); Fischer EG 
et al., J. Clin. Invest. 104:1213-21 (1999) which disclosures are hereby incorporated by reference 

10 in their entirety). Equally important for this process are (a) interactions of the TF cytoplasmic 
domain, which binds the mobility-enhancing actin-binding protein 280, and (b) formation of a 
proteolytically active TF-FVHa complex on the tumor cell surface. In primary bladder carcinoma 
cells, this complex localizes to the invasive edge, in proximity to tumor-infiltrating vessels that 
stain intensely for TFPI-1. Tumor cell adhesion and migration was shown in vitro to be supported 

15 by interaction of TF-FVHa with TFPI-1 immobilized heparin. 

TF antigen has been detected in all cellular elements comprising the atheriosclerotic 
plaque. The most abundant sources of TF appear to be the macrophages and intimal smooth 
muscle cells located in the cap surrounding the lipid-rich necrotic core. TF antigen is also present 
in the medial and endothelial cells overlying the plaque. In addition to its association with vascular 

20 cells, TF antigen is also found in the extracellular matrix of the intima and in the necrotic core. 
This TF may come in contact with circulating blood when the plaque ruptures— the most important 
precipitant of acute arterial thrombosis (Taubman MB et al., Thrombosis and Haemostasis 82:801- 
5 (1999) which disclosure is hereby incorporated by reference in its entirety). 

Recently it has been shown that TFPI-1 inhibits the proliferation of basic fibroblast growth 

25 factor-stimulated endothelial cells. A truncated form of TFPI- 1 , containing only the first two 
Kunitz-type proteinase inhibitor domains, has very little antiproliferative activity, suggesting that 
the carboxyl-terminal region of TFPI-1 is responsible for this activity (Hembrough, TA et al., J. 
Biol. Chem. 276:12241-8 (2001) which disclosure is hereby incorporated by reference in its 
entirely). By virtue of this activity, TFPI-1 is an inhibitor of angiogenesis. Anomalous 

30 angiogeneisis plays an important role in a number of pathologies, including cancer, proliferative 
diabetic retinopathy, and rheumatoid arthritis (Folkman, J, Forum (Geneva) 9(3 Suppl 3):59-62 
(1999); Danis, RP et al., Expert Opin. Pharmacother 2:395-407 (2001); Stupack, DG et al., Braz J. 
Med. Biol. Res. 32:573-81 (1999) which disclosures are hereby incorporated by reference in their 
entirety). 

35 In the case of Tifapinix, alternative splicing results in the internal deletion of exon 5 

comprised of 13 amino acids from the linker region between Kunitz domains 1 and 2 (Girard, TJ et 
al., J. Biol. Chem. 266:5036-41 (1991) which disclosure is hereby incorporated by reference in its 
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entirety). The A58S amino acid substitution that distinguishes Tifapinix-A58S from Tifapinix, as 
well as from the canonical TFPI-1 amino acid sequence (NCBI Accession No. P10646 which 
disclosure is hereby incorporated by reference in its entirety), establishes that the alternative 
splicing of TFPI-1 represented by Tifapinix can occur for more than one allele of TFPI-1, thereby 
5 supporting the thesis that the alternative splicing represented by Tifapinix plays a significant and 
unique role in TFPI-1 biology. 

The resultant shortened linker region between Kunitz domains 1 and 2 is comprised of 7 
amino acids: TREKPDF. The deletion also results in the generation of a novel amino acid 
neighborhood around the two amino acids bracketing the deletion (RE, underlined above). 

10 Tifapinix retains the capacity to bind to FXa (Kunitz domain 2), but has lost the capacity to 

allosterically promote binding of Kunitz domain 1 to TF-FVIIa in response to the FXa binding. As 
Tifapinix retains the capacity to inhibit FXa, Tifapinix therefore remains both an anti-coagulant 
and an anti-inflammatory. As the carboxyl terminus of Tifapinix remains intact, Tifapinix retains 
the capacity to inhibit angiogenesis. 

15 Importantly however and in contradistinction to TFPI-1, by virtue of having lost the 

capacity to allosterically promote TF-FVIIa-binding by Kunitz domain 1, Tifapinix has lost the 
capacity to be recruited by TF-FVIIa for promotion of tumor cell metastasis (Mueller BM et al., J. 
Clin. Invest. 101:1372-8 (1998); Fischer EG et al., J. Clin. Invest. 104:1213-21 (1999) which 
disclosures are hereby incorporated by reference in their entirety). 

20 In a preferred embodiment, the present invention provides for an antibody that specifically 

binds Tifapinix of the present invention. Further preferred is a method for making said antibody 
wherein said antibody recognizes a non-conformational or conformational epitope of Tifapinix. 

Further preferred is a method for making said antibody wherein a mouse is immunized 
with Tifapinix. Further preferred is a method wherein monoclonal antibodies derived from said 

25 mouse are screened for binding to Tifapinix but not to TFPI-1 . Further preferred is a method of 
making said antibody wherein said antibody is directed to the novel linker region sequence of 
Tifapinix comprised of amino acids 105-1 11, numbered from the initiating methionine of 
Tifapinix, or any fragment thereof. Further preferred is a method wherein monoclonal antibodies 
derived from said mouse are screened by sandwich enzyme-linked immunosorbent assay (ELISA) 

30 for binding to Tifapinix but not to TFPI-1 . Methods of generating said monoclonal antibody and 
of estabhshing its specificity by methods including sandwich ELISA are well known to those 
skilled in the art. 

In a preferred embodiment, the present invention provides for a method of contacting said 
antibody and specifically binding it with Tifapinix. Further preferred is a method for using said 
35 antibody diagnostically to determine the basis either for immune dysfunction or for 

inflarnmopathology. In the case of inflammopathology, of which the disease states below are 
representative, the level of Tifapinix expression is expected to be depressed. In the case of non- 
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inflammatory immune dysregulation, Tifapinix status is more difficult to predict a priori. In either 
case, Tifapinix status is expected to facilitate diagnosis and, moreover, facilitate stratification of 
disease states. Furthermore, Tifapinix status may also have prognostic value. Further preferred is 
a method of using said antibody diagnostically in a sandwich ELISA format to quantitate Tifapinix 
5 in plasma or other bodily fluid, including but not restricted to synovial fluid and cerebrospinal 
fluid, within a pathological context. Further preferred is a method of using said diagnostic assay to 
determine the level of Tifapinix in plasma or other bodily fluid of a patient with either 
dysregulated immune function or inflammopathology wherein the immune dysfunction or 
inflammopathology is selected from, but not restricted to, the group consisting of: (a) Rheumatoid 

10 arthritis; (b) Atheriosclerosis; (c) Inflammatory bowel disease; (d) Insulin dependent diabetes 
mellitus (Type 1 diabetes); (e) Systemic lupus erythematosus; (f) Multiple sclerosis; (g) Psoriasis; 
(h) Allergic asthma; (i) Reperfusion injury; and (j) Stroke. 

In further preferred embodiment, the present invention provides for a method of using 
Tifapinix to treat patients with immune dysfunction or inflammopathology. Preferred 

15 compositions comprise Tifapinix. Further preferred compositions comprise Tifapinix. Preferred 
formulation of said composition is that formulation compatible with the route of delivery wherein 
said route of delivery is selected from, but not restricted to, the group: (a) Oral; (b) Transdermal; 
(c) Injection wherein injection is selected from, but not restricted to, the group consisting of: 
intravenous, intramuscular, subcutaneous, intra-synovial, and intra-tumoral; (d) Buccal; and 

20 (e) Aerosol. 

Neovascularization plays a role in the pathogenesis of a number of diseases, including but 
not restricted to rheumatoid arthritis [Danis RP et al., Expert Opin. Pharmacother. 2:395-407 
(2001) which disclosure is hereby incorporated by reference in its entirety]. 

In a further embodiment of the invention, said composition comprised of Tifapinix is used 

25 in a method of treating said patients with immune dysfunction or inflammopathology. Further 
preferred is a method of treating said patients in a method of ameliorating the symptoms or 
pathology associated with said immune dysfunction or inflammopathology. Further preferred is a 
method of treating said patients in a method of ameliorating the symptoms or pathology associated 
with pathogenetic engagement of the extrinsic coagulation pathway or the promotion of 

30 angiogenesis by TF. Further preferred are compositions comprised of Tifapinix used in methods of 
delivering to said patients an ameliorative effective amount of Tifapinix by said route of delivery. 
Further preferred is a method of delivering said composition comprising Tifapinix by said route of 
delivery to patients with immune dysfunction or inflammopathology wherein the immune 
dysfunction or inflammopathology is selected from, but not restricted to, the group: 

35 (a) Rheumatoid arthritis; (b) Atheriosclerosis; (c) Inflammatory bowel disease; (d) Insulin 
dependent diabetes mellitus (Type 1 diabetes); (e) Systemic lupus erythematosus; (f) Psoriasis; 
(g) Multiple sclerosis; (h) Allergic asthma; (i) Reperfusion injury; and (j) Stroke. 



185 



WO 02/094864 



PCT/IB01/01715 



In acute myocardial infarction (AMI), the monocyte TF procoagulant activity is increased 
and may contribute to the risk for recurrence and other thrombotic events [Ott I et al., Blood 
97:3721-6 (2001) which disclosure is hereby incorporated by reference in its entirety]. In a further 
embodiment of the invention, said composition comprised of Tifapinix is used in a method to treat 
5 patients with AM. Further preferred is a method of delivering by intravenous injection an 
ameliorative effective amount of Tifapinix in a method to treat patients with AMI. 

Studies confirm the important role of TF-mediated coagulation in the smooth muscle 
proliferation and neointimal thickening that follows vascular injury [Han X et al., Arterioscler. 
Thromb. Vase. Biol. 19:2563-7 (1999); Taubman MB et al., Thrombosis and Haemostasis 82:801- 

10 5 (1999) which disclosures are hereby incorporated by reference in their entirety]. In a further 
embodiment of the invention, said composition comprised of Tifapinix is used in a method to treat 
patients with neointimal thickening following vascular injury, including but not restricted to that 
consequential to balloon-induced vascular injury. Further preferred is a method of delivering by 
intravenous injection an ameliorative effective amount of Tifapinix in a method to treat patients 

15 with intimal thickening following vascular injury. 

Studies confirm the important role of TF engagement of the extrinsic coagulation pathway 
in vascular pathology. In a further embodiment of the invention, said composition comprised of 
Tifapinix is used in a method to treat patients with said TP-associated vascular pathology. Further 
preferred is a method of delivering by intravenous injection an ameliorative effective amount of 

20 Tifapinix in a method to treat patients with said vascular pathology. Further preferred is a method 
of delivering by intravenous injection an ameliorative effective amount of Tifapinix in a method to 
treat patients with said vascular pathology wherein said pathology is selected from, but not 
restricted to, fee group consisting of: (a) Disseminated intravascular coagulation (DIC); 
(b) Hypercoagulability; and (c) Septic shock. 

25 Proliferative diabetic retinopathy (PDR) remains one of the major causes of aquired 

blindness in developed nations. The hallmark of PDR is neovascularization, abnormal 
angiogenesis that may ultimately cause severe vitreous cavity bleeding and/or retinal detachment. 
In a further embodiment of the invention, said composition comprised of Tifapinix is used in a 
method to treat patients with said PDR. 

30 In a further embodiment of the invention, said composition comprised of Tifapinix is used 

in a method of anti-angiogenesis or anti-metastasis to treat patients with cancer. Further preferred 
is a method of treating said patients in a method of ameliorating the symptoms or pathology 
associated with said cancer. Further preferred are compositions comprised of Tifapinix used in 
methods of delivering to said patients an ameliorative effective amount of Tifapinix by said route 

35 of delivery. Further preferred is a method of delivering Tifapinix by said route of delivery to 
patients with cancer wherein the cancer is selected from, but not restricted to, the group: 
(a) Melanoma; (b) Breast carcinoma; (c) Lung carcinoma; (d) Colon carcinoma; (e) Prostatic 
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carcinoma; (f) Hodgkin's lymphoma; (g) Non-Hodgkin's lymphoma; (h) Pancreatic carcinoma; 
(i) Uterine carcinoma; (j) Ovarian carcinoma; (k) Testicular carcinoma; (1) Renal carcinoma; 
(m) Hepatic carcinoma; and (n) Lung non-small-cell carcinoma. 

Tifapinix represents a uniquely valuable reagent with which to address the molecular basis 
5 for the allosteric relationship between the initial FXa binding to Kunitz domain 2 and the 

subsequent TF-FVIIa binding to Kunitz domain 1 . That is because the lesion is Tifapinix is small 
and well-defined: a deletion of 13 contiguous amino acids from the linker region between Kunitz 
domains 1 and 2. Specifically, the relative importance of linker length and linker amino acid 
composition can be readily addressed. In further preferred embodiment, therefore, the present 

10 invention provides for a method of recombinant DNA manipulation of polynucleotides encoding 
Tifapinix to identify the critical molecular parameters for said allosteric mechanism. Methods of 
manipulating nucleic acid sequence, including but not restricted to site-specific mutagenesis, and 
expression of recombinant protein are well known to those skilled in the art. 

The capacity of Tifapinix to specifically inhibit the serine protease activity of FXa makes it 

15 a very useful reagent for assessing the role either of FXa serine protease activity or more generally 
that of the active site of FXa in a number of activities. These activities include but are not 
necessarily restricted to the group: 

Amplification of extrinsic coagulation, as read out in a clotting assay (Dialynas DP et al. 
Cellular Immunology 177:671-9 (1997) which disclosure is hereby incorporated by reference in its 

20 entirety; 

Serine proteolytic cleavage of specific substrate; and 

Docking with its receptor, EPR-1, expressed on.vascular endothelial cells and smooth 
muscle cells (Nicholson AC et al., J. Biol. Chem. 271:28407-13 (1996) which disclosure is hereby 
incorporated by reference in its entirety). 

25 Whereas (a) and (b) require the active site of FXa, (c) does not Tifapinix therefore would 

be a discriminating reagent with which to assess the involvement of FXa active site in diverse 
activities. For example, Tifapinix blocks (a) and (b), but does not block (c). In further preferred 
embodiment, the present invention provides for a method of using Tifapinix to investigate the 
requirement for FXa active site, and by inference FXa, in an activity manifested by a test sample. 

30 In further preferred embodiment, Tifapinix is used for plasmin binding and inhibition. 

Any suitable method may be used to test the compounds of this invention (US Patent 6,103,499, 
2000). Scatchard (Ann N.Y. Acad Sci (1949) 51:660-669) described a classical method of 
measuring and analyzing binding, which is applicable to protein binding. This method requires 
relatively pure protein and the ability to distinguish bound protein from unbound. A second 

35 appropriate method of measuring K.sub.D is to measure the inhibitory activity against the enzyme. 
If the K.sub.D to be measured is in the 1 nM to 1 muM range, this method requires chromogenic or 
fluorogenic substrates and tens of micrograms to milligrams of relatively pure inhibitor. For the 
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proteins of this invention, having K.sub.D in the range 5 nM to 50 pM, nanograms to micrograms 
of inhibitor suffice. When using this method, the competition between the inhibitor and the 
enzyme substrate can give a measured BLsub.i that is higher than the true K.sub.i. Measurement 
reported here is not so corrected because the correction would be very small and the any correction 
5 would reduce the K.sub.i. Here, we use the measured K.sub.i as a direct measure of KD. Tifapinix 
has a K.sub.D for plasmin of at most about 5nM, more preferably at most about 300 pM, and most 
preferably 100 pM or less. Preferably, the binding is inhibitory so that K.sub.i is the same as 
K.sub.D. The K.sub.i of QS4 for plasmin is about 2 nM. The K.sub.i of SPI1 1 for plasmin is about 
88 pM. 

10 In another preferred embodiment, Tifapinix is used for pharmaceutical methods and 

preparations. The preferred subject of this invention is a mammal. The invention is particularly 
useful in the treatment of humans, but is suitable for veterinary applications, too. Herein, 
"protection" includes "prevention", "suppression", and "treatment". "Prevention" involves 
administration of drug prior to the induction of disease. "Suppression" involves administration of 

15 drug prior to the clinical appearance of disease. "Treatment" involves administration of drug after 
the appearance of disease. In human and veterinary medicine, it may not be possible to distinguish 
between "preventing" and "suppressing" since the inductive event(s) may be unknown or latent, or 
the patient is not ascertained until after the occurrence of the inductive event(s). We use the term 
"prophylaxis" as distinct from "treatment" to encompass "preventing" and "suppressing". Herein, 

20 "protection" includes "prophylaxis". Protection need not by absolute to be useful. Tifapinix or 
fragments thereof may be administered, by any means, systemically or topically, to protect a 
subject against a disease or adverse condition. For example, administration of such a composition 
may. be by any parenteral route, by bolus injection or by gradual perfusion. Alternatively, or 
concurrently, administration may be by the oral route. A suitable regimen comprises 

25 administration of an effective amount of the protein, administered as a single dose or as several 
doses over a period of hours, days, months, or years. The suitable dosage of a protein of this 
invention may depend on the age, sex, health, and weight of the recipient, kind of concurrent 
treatment, if any, frequency of treatment, and the desired effect. However, the most preferred 
dosage can be tailored to the individual subject, as is understood and determinable by one of skill 

30 in the art, without undue experimentation by adjustment of the dose in ways known in the art. For 
methods of preclinical and clinical testing of drugs, including proteins, see, e.g., Berkow el al, eds., 
The Merck Manual, 15 th edition, Merck and Co., Rahway, N.J., 1987; Goodman et al, eds., 
Goodman and Oilman's The Pharmacological Basis of Therapeutics, 8th edition, Pergamon Press, 
Inc., Elmsford, N.Y., (1990); Avery's Drug Treatment: Principles and Practice of Clinical 

35 Pharmacology and Therapeutics, 3rd edition, ADIS Press, LTD., Williams and Wilkins, Baltimore, 
Md. (1987), Ebadi, Pharmacology, Little, Brown and Co., Boston, (1985), which references are 
hereby incorporated in their entirety. In addition to Tifapinix, a pharmaceutical composition may 
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contain pharmaceutically acceptable carriers, excipients, or auxiliaries. See, e.g., Berker, supra, 
Goodman, supra, Avery, supra and Ebadi, supra. 

In yet another preferred embodiment, Tifapinix or fragments thereof are used for in vitro 
diagnostic methods and reagents. Tifapinix and related sequences may be applied in vitro to any 
5 suitable sample that might contain plasmin to measure the plasmin present. The assay must 

include a Signal Producing System (SPS) providing a detectable signal that depends on the amount 
of plasmin present. The signal may be detected visually or instrumentally. Possible signals 
include production of colored, fluorescent, or luminescent products, alteration of the characteristics 
of absorption or emission of radiation by an assay component or product, and precipitation or 

10 agglutination of a component or product. The component of the SPS most intimately associated 
with the diagnostic reagent is called the "label". A label may be, e.g., a radioisotope, a fluorophore, 
an enzyme, a co-enzyme, an enzyme substrate, an electron-dense compound, or an agglutinable 
particle. A radioactive isotope can be detected by use of, for example, a .gamma, counter or a 
scintillation counter or by autoradiography. Isotopes which are particularly useful are .sup.3 H, 

15 .sup.125 I, .sup.131 1, .sup.35 S, .sup.14 C, and, preferably, .sup.125 I. It is also possible to label a 
compound with a fluorescent compound. When the fluorescent-labeled compound is exposed to 
light of the proper wavelength, its presence can be detected. Among the most commonly used 
fluorescent labeling compounds are fluorescein isothiocyanate, rhodamine, phycoerythrin, 
phycocyanin, allophycocyanin, o-phthaldehyde, and fluorescamine. Alternatively, fluorescence- 

20 emitting metals, such as .sup.125 Eu or other anthanide, may be attached to the binding protein 
using such metal chelating groups as diethylenetriaminepentaacetic acid or ethylenediamine- 
tetraacetic acid. The proteins also can be detectably labeled by coupling to a chemiluminescent 
compound, such as luminol, isolumino, theromatic acridinium ester, imidazole, acridinium salt, 
and oxalate ester. Likewise, a bioluminescent compound, such as luciferin, luciferase and 

25 aequorin, may be used to label the binding protein. The presence of a bioluminescent protein is 
determined by detecting the presence of luminescence. Enzyme labels, such as horseradish 
peroxidase and alkaline phosphatase, are preferred. There are two basic types of assays: 
heterogeneous and homogeneous. In heterogeneous assays, binding of the affinity molecule to 
analyte does not affect the label; thus, to determine the amount of analyte, bound label must be 

30 separated from free label. In homogeneous assays, the interaction does affect the activity of the 
label, and analyte can be measured without separation. Tifapinix, as a plasmin-binding protein 
may be used diagnostically in the same way that an antiplasmin antibody is used. Thus, depending 
on the assay format, it may be used to assay plasmin, or, by competitive inhibition, other 
substances which bind plasmin. The sample will normally be a biological fluid, such as blood, 

35 urine, lymph, semen, milk, or cerebrospinal fluid, or a derivative thereof, or a biological tissue, 
e.g., a tissue section or homogenate. If the sample is a biological fluid or tissue, it may be taken 
from a human or other mammal, vertebrate or animal, or from a plant. The preferred sample is 
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blood, or a fraction or derivative thereof. In a related embodiment, Tifapinix or fragments thereof 
is immobilized, and plasmin in the sample is allowed to compete with a known quantity of a 
labeled or specifically labelable plasmin analogue. The "plasmin analogue" is a molecule capable 
of competing with plasmin for binding to Tifapinix or fragments thereof. It may be labeled 
5 already, or it may be labeled subsequently by specifically binding the label to a moiety 
differentiating the plasmin analogue from plasmin. The phases are separated, and the labeled 
plasmin analogue in one phase is quantified. In a "sandwich assay", both an insolubilized plasmin- 
binding agent (PBA), and a labeled PB A are employed. The plasmin analyte is captured by the 
insolubilized PBA and is tagged by the labeled PBA, forming a tertiary complex. The reagents 

10 may be added to the sample in any order. The PBAs may be the same or different, and only one 
PBA needs to comprise Tifapinix or fragments thereof according to this invention (the other may 
be, e.g., an antibody). The amount of labeled PBA in the tertiary complex is directly proportional 
to the amount of plasmin in the sample. The two embodiments described above are both 
heterogeneous assays. A homogeneous assay requires only that the label beaffected by the binding 

15 of Tifapinix or fragments thereof to plasmin. The plasmin analyte may act as its own label if 
Tifapinix or fragments thereof are used as a diagnostic reagent. A label may be conjugated, 
directly or indirectly (e.g., through a labeled anti-Tifapinix antibody), covalently (e.g., with SPDP) 
or noncovalently, to the plasmin-binding protein, to produce a diagnostic reagent. Similarly, the 
plasrnin-binding protein may be conjugated to a solid phase support to form a solid phase 

20 ("capture") diagnostic reagent. Suitable supports include glass, polystyrene, polypropylene, 
polyethylene, dextran, nylon, amylases, and magnetite. The carrier can be soluble to some extent 
or insoluble for the purposes of this invention. The support material may have any structure so 
long as the coupled molecule is capable of binding plasmin. 

In yet another preferred embodiment, Tifapinix or fragments thereof are used for in vivo 

25 diagnostic uses. Tifapinix or fragments thereof, i.e. a Kunitz domain that binds very tightly to 
plasmin can be used for in vivo imaging. Radiolabeled Tifapinix may be administered to a human 
or animal subject, typically by injection, e.g., intravenous or arterial other means of administration 
such as subcutaneous, intramuscular in a quantity sufficient to permit subsequent dynamic and/or 
static imaging using suitable radio-detecting devices. The dosage is the smallest amount capable 

30 of providing a diagnostically effective image, and may be determined by means conventional in the 
art, using known radio-imaging agents as guides. Typically, the imaging is carried out on the 
whole body of the subject, or on that portion of the body or organ relevant to the condition or 
disease under study. The radiolabeled binding protein has accumulated. The amount of 
radiolabeled binding protein accumulated at a given point in time in relevant target organs can then 

35 be quantified. A particularly suitable radio-detecting device is a scintillation camera, such as a. 
gamma, camera. The detection device in the camera senses and records (and optional digitizes) the 
radioactive decay. Digitized information can be analyzed in any suitable way, many of which are 
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known in the art. For example, a time-activity analysis can illustrate uptake through clearance of 
the radiolabeled binding protein by the target organs with time. The radioisotope used should 
preferably be pharmacologically inert, and the quantities administered should not have substantial 
physiological effect. The binding protein may be radio-labeled with different isotopes of iodine, 
5 for example .sup.123 I, .sup.125 1, or .sup.131 1 (see, for example, U.S. Pat. No. 4,609,725). The 
amount of labeling must be suitably monitored. 

In applications to human subjects, it may be desirable to use radioisotopes other than. 
sup.125 I for labeling to decrease the total dosimetry exposure of the body and to optimize the 
detectability of the labeled molecule. Considering ready clinical availability for use in humans, 

10 preferred radio-labels include: .sup.99m Tc, .sup.67 Ga, .sup.68 Ga .sup.90 Y, .sup. 1 1 1 In, 
.sup.H3m In, .sup.123 I, .sup.186 Re, .sup.188 Re, or .sup.211 At. Radiolabeled protein maybe 
prepared by various methods. These include radio-halogenation by the chloramine-T or 
lactoperoxidase method and subsequent purification by high pressure liquid chromatography, for 
example, see Gutkowska et al in "Endocrinology and Metabolism Clinics of America 16 (1):183, 

15 1987. Other methods of radiolabeling can be used, such as IODOBEADS.TM. Tifapinixor 

fragments thereof may also be used to purify plasmin from a fluid, e.g., blood. For this purpose, it 
is preferably immobilized on an insoluble support. Such supports include those also useful in 
preparing solid phase diagnostic reagents. Proteins can be used as molecular weight markers for 
reference in the separation or purification of proteins. 

20 These embodiments also relate to isolation, purification and production of antibodies 

wherein antibodies can be polyclonal or monoclonal as described (US Patent 6,171,587 Bl, 2000), 
hereby enclosed in their entirety. 

Another preferred embodiment relates to the use of Tifapinix and Kunitz domains thereof 
for the inhibition of kallikrein activity. Kallikreins are serine proteases found in both tissues and 

25 plasma (see US Patent 5,994, 125, 1 999, US Patent 6,057,287, 2000) which references are hereby 
enclosed in their entirety). Plasma kallikrein is involved in contact-activated coagulation, 
fibrinolysis, hypotension, and inflammation mediated through the activities of factor XII 
(coagulation), pro-urokinase/plasminogen (fibrinolysis), and kininogens (hypotension and 
inflammation). Kallikrein cleavage of kininogens results in the production of highly potent 

30 bioactive peptides (kinins), which cause increased vascular permeability, vasodilation, 

bronchospasm, and pain induction. Thus, kinins mediate life-threatening vascular shock and 
edema associated with bacteremia (sepsis) or trauma, asthma, and inflammatory and neurogenic 
pain associated with tissue injury, and edema in CI -inhibitor-deficient diseases (hereditary 
angioedema). Tifapinix, as a protease inhibitor, and fragments thereof said Kunitz domains, 

35 prevent the cleavage of kallikrein and thus the release of said kinins. 

Tifapinix may be used for any of the foregoing purposes. Methods for production using 
eukaryotic and prokaryotic expression systems have been reported previously and are well known 
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in the art (US Patent 6,103,500, 2000; PCT WO 95/18830). For example, Tifapinix or fragments 
thereof, whereas preferred fragments comprise said Kunitz domains, preferably Kunitz domain 
three, may be produced by any conventional technique including (i) nonbiological synthesis by 
sequential coupling of component amino acids, (ii) production by recombinant DNA techniques in 
5 a suitable host cell such as bacterial, insect- or mammalian cells, (iii) removal of undesired 
sequences from LACI and in coupling of synthetic replacement sequences (US patent 5,994,125, 
1999, hereby incorporated in its entirety). 

Protein of SEQ ID NO:50 (Internal designation Clone 588098 J84-11-4-0-H4-F) 

The cDNA of Clone 588098_184-1 1-4-0-H4-F (SEQ ID NO:49) encodes the protein of 

10 SEQ ID NO: 50 comprising the amino acid sequence 

MPSSVSWGD^LLAGLCCLWVSLGTKLADTHDEILEGL 
PDSQLQLTTGNGIJFLSEGLKLV^ 
QGKWDLVKELDRDTWALVN^ 
GMFOTQHCKKLSSWVLLMKY^ 

15 LHLPKLSrTGTYDLKSVL^ 

AAGAMFLEAffMSIPPEVKH^JKPF Accordingly it will 

be appreciated that all characteristics and uses of polypeptides of SEQ ID NO:50 described 
throughout the present application also pertain to the polypeptides encoded by the nucleic acids 
included in Clone 588098J 84-1 1-4-0-H4-F. In addition, it will be appreciated that all 

20 characteristics and uses of the polynucleotides of SEQ ID NO:49 described throughout the present 
application also pertain to the nucleic acids included in Clone 588098_184-1 1-4-0-H4-F. A 
preferred embodiment of the invention is directed toward the compositions of SEQ ID NO:49, 
SEQ ID NO:50, and Clone 588098_184-1 1-4-0-H4-F. Also preferred are polypeptide fragments 
having a biological activity as described herein and the polynucleotides encoding the fragments. 

25 The protein of SEQ ID NO:50 encodes CrypAAT, a splice variant of alpha- 1 -antitrypsin 

(antitrypsin) with novel function. In CrypAAT, internal splicing within exon 2 leaves the signal 
sequence intact but results in an N-terminal deletion of 67 amino acids from the mature protein. 
This deletion extends from the disordered N-terminus through helix A and into helix B (Stein, PE 
et al., Nature Structural Biology 2:96-1 13 (1995) which disclosure is hereby incorporated by 

30 reference in its entirety). The Met-Ser active site near the C-terminus is intact. 

Antitrypsin is synthesized primarily by hepatocytes and is the most abundant proteinase 
inhibitor in human plasma. Although it diffuses through all organs, and inhibits a large number of 
proteases, its primary function is in the lung parenchyma, where it protects alveolar tissue from 
damage by neutrophil elastase, a serine protease released in the course of an inflammatory 

35 response. Elastases are defined by their ability to cleave elastin, the matrix protein that gives 
tissues the property of elasticity. If left uncontrolled, neutrophil elastase leads to excessive 
inflammation and progressive emphysema. Individuals with antitrypsin deficiency have at least a 
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20-fold increase risk of developing emphysema. 

Antitrypsin is a member of the serpin (serine protease inhibitor) supergene family. The 
primary function of most of the serpins is the regulation of proteolytic enzymes under both 
physiological and pathological conditions. On the basis of strong sequence similarities, a number 
5 of proteins with no known inhibitory activity have been classified as serpins. For example, 

thyroxine binding globulin (TBG) and corticosteroid binding globulin (CBG) serve as transporters 
of lipophilic hormones, and angiotensinogen is a peptide hormone precursor (Janciauskiene, S, 
Biochimica et Biophysica Acta.l535:221-35 (2001) which disclosure is hereby incorporated by 
reference in its entirety). 

10 Serpins are competitive, irreversible inhibitors of serine proteases. Serpins have a common 
molecular design based on a five-stranded beta-sheet A and the reactive loop arising from it, that 
presents a peptide sequence to the target proteinase. The function of antitrypsin as a proteinase 
inhibitor depends on its undergoing conformational change when it binds to neutrophil elastase. 
This change involves the insertion of the cleaved reactive loop as the 4 th strand in its beta-sheet A, 

1 5 and deactivates neutrophil elastase by swinging it from the top to the bottom of the antitrypsin 
molecule (described as a mousetrap action) (Parmar, JS et al., Journal of the Royal College of 
Physicians of London 34:295-300 (2000) which disclosure is hereby incorporated by reference in 
its entirety). A complex 'shutter' domain is responsible for maintaining the usual, closed state of 
beta-sheet A (Stein, PE et al., Nature Structural Biology 2:96-1 13 (1995); Gils, A et al. Thromb. 

20 Haemost. 80:53 1-41 (1 998) which disclosures are hereby incorporated by reference in their 
entirety). 

By virtue of conformational perturbation imposed on the protein by the novel splicing 
event, CrypAAT is without proteinase inhibitory function. CrypAAT retains its susceptibility to 
cleavage by neutrophil elastase, however. It also retains its susceptibility to cleavage by a number 

25 of non-target proteinases, such as gelatinase B (MMP-9). Unlike antitrypsin, therefore, CrypAAT 
functions as a proteinase substrate. Cleavage of CrypAAT by neutrophil elastase and the non- 
target proteinases generates a 4 kDa C-terminal fragment of 36 residues, which on cleavage 
remains non-covalently bound to the cleaved CrypAAT. 

CrypAAT plays a number of diverse physiological roles as a proteinase substrate 

30 (Janciauskiene, S, Biochimica et Biophysica Acta 1535:221-35 (2001) which disclosure is herebly • 
incorporated by reference in its entirety). Cleaved CrypAAT contributes to the later phase of 
polymorphonuclear leukocyte infiltration and is a potent chemoattractant for monocytes. The 
isolated C-terminal fragment of CrypAAT can associate with extracellular matrix proteins such as 
collagen and/or laminin-1 and, in so doing, play an important role in protecting these proteins from 

35 inappropriate enzyme digestion. The C-terminal fragment of CrypAAT also exerts significant 
effects on cellular lipid catabolism and proinflammatory activation, by activating peroxisome 
proliferator-activated receptors (PPARs), transcription factors that recently have been proposed to 



193 



WO 02/094864 



PCT/IB01/01715 



regulate genes for lipid metabolism and proinflammatory proteins. 

CrypAAT has also been implicated in several pathologies, namely atherosclerosis and 
cancer. The C-terminal cleavage fragment of CrypAAT is a component of atherosclerotic plaque, 
located specifically in the fibrous cap near the necrotic core. CrypAAT plays a role in 
5 atherosclerosis as a protease substrate and a reservoir of physiologically active peptide degradation 
products. CrypAAT-positive adenocarcinomas of colon and lung have a worse prognosis than 
CrypAAT-negative ones. Recent studies provide good experimental evidence that the C-terminal 
fragment of CrypAAT generated by matrix metalloproteinases (MMPs) enhances tumor growth 
and invasiveness in vivo. 

10 Contrary to previous dogmas, it is now well established that brain cells can produce 

cytokines and chemokines, and can express adhesion molecules than enable an in situ 
inflammatory reaction. Brain ischemia and trauma elicit robust inflammation. The accumulation 
of neutrophils early after brain injury is believed to contribute to the degree of brain tissue loss. 

In a preferred embodiment, the present invention provides for an antibody that specifically 

15 binds CrypAAT of the present invention. Further preferred is a method for making said antibody 
wherein said antibody recognizes a non-conformational or conformational epitope of CrypAAT. 

Further preferred is a method for making said antibody wherein a mouse is immunized 
with CrypAAT. Further preferred is a method wherein monoclonal antibodies derived from said 
mouse are screened for binding to CrypAAT but not to antitrypsin. Further preferred is a method 

20 wherein monoclonal antibodies derived from said mouse are screened by enzyme-linked 

immunosorbent assay (ELISA) for binding to CrypAAT but not to antitrypsin. Further preferred is 
a method wherein said antibody is screened for the capacity to sterically or allosterically abrogate 
the protease susceptibility of CrypAAT. Further preferred is a method wherein said antibody is 
screened for the capacity to sterically or allosterically abrogate the neutrophil elastase or gelatinase 

25 susceptibility of CrypAAT. Methods of generating said monoclonal antibody and of establishing 
its specificity by methods including ELISA are well known to those skilled in the art. Methods of 
screening said antibody for the capacity to abrogate the protease susceptibility of CrypAAT are 
well known to those skilled in the art and include, but are not limited to: contacting the antibody 
with CrypAAT, incubating the antibody-CrypAAT complex with neutrophil elastase or gelatinase, 

30 and following proteolytic generation of the 4 kDa carboxyl fragment by denaturing polyacrylamide 
gel electrophoresis. 

hi a preferred embodiment, the present invention provides for a method of contacting said 
antibody and specifically binding it with CrypAAT. Further preferred is a method for using said 
antibody diagnostically to determine the basis either for immune dysfunction or for 
35 inflammopathology. Further preferred is a method of using said antibody diagnostically in a 

sandwich ELISA format to quantitate CrypAAT in plasma or other bodily fluid, including but not 
restricted to synovial fluid and cerebrospinal fluid, within a pathological context. Further 
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preferred is a method of using said diagnostic assay to determine the level of CrypAAT in plasma 
or other bodily fluid of a patient with either dysregulated immune function or inflammopathology 
wherein the immune dysfunction or inflammopathology is selected from, but not restricted to, the 
group consisting of: (a) Rheumatoid arthritis; (b) Atheriosclerosis; (c) Inflammatory bowel 
5 disease; (d) Insulin dependent diabetes mellitus (Type 1 diabetes); (e) Systemic lupus 

erythematosus; (f) Multiple sclerosis; (g) Psoriasis; (h) Allergic asthma; (i) Acute myocardial 
infarction; (j) Septic shock; (k) Reperfusion injury; and (1) Stroke. 

In further preferred embodiment, the present invention provides for a method of contacting 
and specifically binding to CrypAAT said antibody having the capacity to abrogate the proteolytic 

1 0 susceptibility, including but not restricted to that of neutrophil elastase and gelatinase, of 

CrypAAT. Further preferred is a method of using said antibody in contact wife CrypAAT as a 
therapeutic for patients with either immune dysfunction or inflammopathology. Preferred 
compositions comprise said CrypAAT antibody or fragments or derivatives thereof. Preferred 
formulation of said composition is that compatible with the route of delivery wherein said route of 

15 delivery is selected from, but not restricted to, the group: (a) Oral; (b) Transdermal; (c) Injection; • 
(d) Buccal; and (e) Aerosol. 

In further preferred embodiment, the present invention provides for a method of contacting 
and specifically binding to CrypAAT said antibody having the capacity to abrogate the proteolytic 
■susceptibility, including but not restricted to that of neutrophil elastase and gelatinase, 

20 susceptibility of CrypAAT. Further preferred is a method for using said CrypAAT antibody to 
treat patients with immune dysfunction or inflammopathology. Further preferred is a method of 
treating said patients with said CrypAAT antibody in a method of ameliorating the symptoms or 
pathology associated with immune dysfunction or inflammopathology. Said CrypAAT antibody 
ameliorates the symptoms or pathology associated with immune dysfunction or 

25 inflammopathology by suppressing proteolytic generation of bioactive fragments of CrypAAT, 
including but not restricted to the 4 kDa carboxyl fragment. Further preferred is a method of 
delivering to said patients an ameliorative effective amount of said CrypAAT antibody. Further 
preferred is a method of delivering to said patients an ameliorative effective amount of said 
CrypAAT antibody by injection. Further preferred is a method of delivering to said patients with 

30 immune dysfunction or inflammopathology an ameliorative effective amount of said CrypAAT 
antibody wherein said immune dysfunction or inflammopathology is selected from, but not 
restricted to, the group: (a) Rheumatoid arthritis; (b) Atheriosclerosis; (c) Inflammatory bowel 
disease; (d) Insulin dependent diabetes mellitus (Type 1 diabetes); (e) Systemic lupus 
erythematosus; (f) Psoriasis; (g) Multiple sclerosis; (h) Allergic asthma; (i) Acute myocardial 

35 infarction; (j) Septic shock; (k) Reperfusion injury; and (1) Stroke. 

. Further preferred is a method of contacting and specifically binding said antibody with 
CrypAAT in a method of transdermal contact to ameliorate the symptoms or pathology of 
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psoriasis. Further preferred are compositions comprised of said CrypAAT antibody used in 
methods of contacting the psoriatic lesion with an ameliorative effective amount of said CrypAAT 
antibody by injection or transdermal contact at the site of the lesion. 

Further preferred is a method of contacting and specifically binding said antibody with 
5 CrypAAT in a method to ameliorate the symptoms or pathology of allergic asthma. Preferred 
route of delivery is aerosol. Further preferred are compositions comprised of said CrypAAT 
antibody used in methods of contacting asthmatic tissue with an ameliorative effective amount of 
said CrypAAT antibody by aerosol. 

Further preferred is a method of contacting and specifically binding said antibody with 

10 CrypAAT in a method to ameliorate the symptoms or pathology of allergic rhinitis (hayfever). 
Preferred route of delivery is aerosol. Further preferred are compositions comprised of said 
CrypAAT antibody used in methods of contacting inflamed nasal tissue with an ameliorative 
effective amount of said CrypAAT antibody by aerosol. 

In a further embodiment, the present invention provides for said CrypAAT antibody to be 

1 5 used in a method to suppress acute inflammation. Further preferred is a method to use said 

CrypAAT antibody to suppress inflammation associated with wound healing. Further preferred are 
compositions comprised of said antibody used in methods of contacting a wound or injured tissue 
with an ameliorative effective amount by injection or transdermal contact at the site of the wound. 
In further preferred embodiment, the present invention provides for a method of contacting 

20 and specifically binding to CrypAAT said antibody having the capacity to abrogate the proteolytic 
susceptibility, including but not restricted to that of neutrophil elastase and gelatinase, 
susceptibility of CrypAAT. Further preferred is a method of treating cancer patients with said 
CrypAAT antibody in a method of ameliorating the symptoms or pathology associated with the 
cancer. Said CrypAAT antibody ameliorates the symptoms or pathology associated with cancer 

25 (including but not restricted to metastasis and invasiveness) by suppressing proteolytic generation 
of bioactive fragments of CrypAAT, including but not restricted to the 4 kDa carboxyl fragment. 
Further preferred is a method of delivering to said patients an ameliorative effective amount of said 
CrypAAT antibody by said route of delivery. Preferred route of delivery is intravenous or intra- 
tumoral injection. Further preferred is a method of delivering to said patients with cancer an 

30 ameliorative effective amount of said CrypAAT antibody wherein said cancer is selected from, but 
not restricted to, the group: (a) Melanoma; (b) Breast carcinoma; (c) Lung carcinoma; (d) Colon 
carcinoma; (e) Hodgkin's lymphoma; (f) Non-Hodgkin's lymphoma; (g) Prostatic 
carcinoma; (h) Pancreatic carcinoma; (i) Uterine carcinoma; (j) Ovarian carcinoma; (k) Testicular 
carcinoma; (1) Renal carcinoma; (m) Hepatic carcinoma; and (n) Lung non-small-cell carcinoma. 

35 In a further embodiment, the present invention provides for said CrypAAT antibody to be 

used in a method of preclinical pharmacology in animal models of disease, including but not 
restricted to those of immune dysfunction, inflammopathology, and cancer. 
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Further preferred is a method in which said CrypAAT antibody is used in a rodent or 
primate model of human immune dysfunction or inflammopathology to optimize the therapeutic 
efficacy of said CrypAAT antibody. Further preferred is a method in which said CrypAAT 
antibody is used in a rodent or primate model of human immune dysfunction or 
5 inflammopathology wherein said immune dysfunction or inflammopathology is selected from but 
not restricted to the group: (a) Rheumatoid arthritis; Atheriosclerosis; Inflammatory bowel 
disease; Insulin dependent diabetes (Type 1 diabetes); Systemic lupus erythematosus; Psoriasis; 
Multiple sclerosis; Allergic asthma; Acute myocardial infarction; Septic shock; Reperfusion injury; 
and Stroke. 

10 Further preferred is a method in which said CrypAAT antibody is used in a mouse model 

of human cancer to optimize the therapeutic efficacy of said CrypAAT antibody. Further preferred 
is a method in which said CrypAAT antibody is used in a xenogeneic mouse model of human 
leukemia. Preferred route of delivering said composition comprised of CrypAAT antibody 
includes but is not restricted to intravenous injection and implanted pump. Further preferred is a 

1 5 method in which said CrypAAT antibody is used in a xenogeneic mouse model of human leukemia 
engrafted with primary leukemia cells obtained from patients (Dialynas, DP et aL, Blood 97:3218- 
25 (2001) which disclosure is hereby incorporated by reference in its entirety) wherein the 
leukemia is selected from but not restricted to the group: (a) Childhood T lymphocyte acute 
lymphoblastic leukemia (Pediatric T-ALL); (b) Adult T lymphocyte acute lymphoblastic leukemia 

20 (Adult T-ALL); (c) B lymphocyte acute lymphoblastic leukemia (B-ALL); (d) Acute myeloid 
leukemia (AML); (e) Chronic lymphocytic leukemia (CLL); and (f) Multiple myeloma. 

In a further embodiment, the present invention provides for the use of said CrypAAT 
antibody in a method to abrogate proteolytic generation of the bioactive 4 kDa fragment of 
CrypAAT in in vitro cell cultures using human serum. Further preferred is a method of contacting 

25 and specifically binding said CrypAAT antibody to CrypAAT in culture.to block in situ proteolytic 
generation of the bioactive 4 kDa carboxyl fragment from CrypAAT introduced into culture by 
human serum. Further preferred is a method of contacting and specifically binding said CrypAAT 
antibody immobilized on a resin to CrypAAT to deplete CrypAAT from human serum samples by 
immunoaffinity chromatography. 

30 hi a further embodiment, the present invention provides for the screening of test 

compounds for the capacity to specifically bind to CrypAAT and block the proteolytic generation 
of the 4 kDa carboxyl fragment by proteases including but not restricted to neutrophil elastase and 
gelatinase. Further preferred are said test compounds that specifically bind to either a non- 
conformational or conformational site on CrypAAT. Further preferred are said test compounds 

35 that block said proteolytic cleavage of CrypAAT either sterically or allosterically. Further 

preferred is a method of screening said test compounds for the capacity to block cleavage within 
the active site of CrypAAT by neutrophil eleastase or non-target proteinase and so generate the 4 
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kDa carboxyl fragment. Methods of screening said test compound for the capacity to abrogate the 
protease susceptibility of CrypAAT are well known to those skilled in the art and include, but are 
not limited to: contacting the test compound with CrypAAT, incubating the test compound- 
CrypAAT complex with neutrophil elastase or gelatinase, and following proteolytic generation of 
5 the 4 kDa carboxyl fragment by denaturing polyacrylamide gel electrophoresis. 

Preferred formulations of said compound are those selected from, but not restricted to, 
formulations amenable to the routes of delivery selected from the group: (a) Oral; 
(b) Transdermal; (c) Injection; (d) Buccal; and (e) Aerosol. 

Compounds found to block the cleavage of CrypAAT within its active site by elastase or 
10 non-target proteinase are used in in vivo and in vitro methods analogous to those described above 
for CrypAAT antibody. 

Protein of SEQ ID NO:54 (Internal designation Clone 789749_182-14-3-0-C12-F) 

The cDNA of clone 789749J 82-14-3-0-C12-F (SEQ ID NO:53) encodes the protein of 
SEQ ID NO:54 comprising the amino acid sequence: 
15 MHFCGGTLISPEWVLTAAHCLEKSP 
DLALLKLSSPAVITDKVIPACL^ 

KVCNRYEFLNGRVQSTELCAGHLAGGTDSCQGDSGGPLVCFEKDKYI^ 
RPNKPGVYWVSRFVTWIEGVMRNN. 

Accordingly it will be appreciated that all characteristics and uses of polypeptides of SEQ ID 
20 NO:54 described throughout the present application also pertain to the polypeptides encoded by the 

nucleic acids included in Clone 789749__182-14-3-0-C12-F. In addition, it will be appreciated that 

all characteristics and uses of the polynucleotides of SEQ ID NO:53 described throughout the 

present application also pertain to the nucleic acids included in Clone 789749__182-14-3-0-C12-F. 

Also preferred are fragments having a biological activity as described therein and the 
25 polynucleotides encoding the fragments. 

The protein of SEQ ID NO:54 encodes Plasminute, a variant of plasmin resulting from 

alternative transcription initiation within the plasminogen gene. Plasminute has novel function as 

described below. 

The terminal event in activation of the human fibrinolytic system is generation of the 
30 enzyme plasmin, a serine protease possessing a variety of functional properties, the most notable of 
which is clearance by proteolytic degradation of fibrin deposits. Plasmin is formed upon activation 
of its zymogen, plasminogen, as a result of cleavage of a single peptide bond. This latter event is 
catalyzed by serine proteases with narrow specificity, termed plasminogen activators. Urokinase- 
type plasminogen activator and tissue-type plasminogen activator act directly; streptokinase acts 
35 indirectly. In its capacity as a serine protease, plasmin functions to dissolve the fibrin clot. Each 
of these plasminogen activators is commercially available and is indicated for the treatment of 
acute vascular diseases such a myocardial infarct, stroke, pulmonary embolism, deep vein 
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thrombosis, peripheral arterial occlusion, and other venous thromboses (US Patent 5,753,486; 
"Human tissue plasminogen activator;" which disclosure is hereby incorporated by reference in its 
entirety). Collectively, these diseases account for major health hazards and risks. 

In its capacity as a serine protease, plasmin also plays a role in normal processes involving 
5 cell migration in tissue remodeling. In this regard, plasmin is believed to function in processes in 
which cell movement is essential, such as macrophage invasion in inflammation, angiogenesis, and 
keratinocyte accumulation after wound healing. Furthermore, plasmin has also been strongly 
implicated as an important mediator in pathological processes of cell migration that are involved in 
tumor cell growth (plasmin can activate growth factors) and invasion of surrounding tissue and, 

10 perhaps, metastases. Involvement of plasmin in these latter processes is supported by the ability of 
plasmin to degrade extracellular matrix proteins directly, such as proteoglycans, fibronectin, 
laminin, and type IV collagen, and/or be indirectly responsible for the degradation of matrix 
proteins through activation of metalloprotease zymogens, such as stromolysin andprocollagenase. 
As a result of degradation of the extracellular matrix, cell migration into surrounding areas 

1 5 becomes more facile (Castellino, F J in Molecular Basis of Tlirombosis and Hemostasis, High, KA 
& Roberts, HR, editors, New York, pp 495-5 15 (1995) which disclosure is hereby incorporated by 
reference in its entirety). 

.Plasminogen is synthesized by endothelial cells as an 810-residue single chain 
glycoprotein, from which is excised a 19-residue signal peptide during secretion. Plasminogen is 

20 converted to plasmin as a result of activator-catalyzed cleavage of the Arg561-Val562 peptide 
bond (numbered from the ammo-terminal glutamic acid residue of secreted plasminogen). The 
resulting plasmin contains a heavy chain, originating from the ammo-terminus of plasminogen, 
doubly disulfide-linked to a light chain. This latter region, containing the carboxy-terminus of 
plasminogen, is homologous to serine proteases such as trypsin and elastase. The heavy chain of 

25 plasmin consists of five repeating triple-disulfide-linked peptide regions, about 80 amino acid 
residues in length, termined kringles, that are responsible in part for interactions of plasmin with 
inhibitors (Castellino, FJ et al., Ciba Found. Symp. 212:46-60 (1997) which disclosure is hereby 
incorporated by reference in its entirety). 

The gene for human plasminogen spans about 52.5 kilobases of DNA and consists of 19 

30 exons separated by 1 8 introns (Petersen, TE et al., J. Biol. Chem. 265:6104-1 1 (1990) which 
disclosure is hereby incorporated by reference in its entirety). 

Plasminogens exist in about three different forms in solution, as determined by laser light 
scattering experiments and NMR. Three truncated forms of plasminogen have been described. 
Removal of the amino-terminal domain yields a shorter proenzyme (Lys-plasminogen) that is more 

35 efficiently activated than the parent (Glu-plasminogen). Further cleavage using the enzyme 

elastase removes the amino-terminal domain and four kringle domains leaving miniplasminogen. 
Miniplasminogen is activatable by urokinase to the enzyme miniplasmin with fibrinolytic activity 
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equivalent to that of plasmin. The most striking functional difference of miniplasmin is its relative 
resistance to inhibition by the primary plasmin inhibitor, alpha-2-antiplasmin, probably reflecting 
the absence of kringle domain 1, which is thought to facilitate primary interaction of plasmin with 
the inhibitor (Moroz, LA, Blood, 58:97-104 (1981) which disclosure is hereby incorporated by 
5 reference in its entirety). 

A functionally active human microplasminogen without kringle structures was produced 
by incubation of plasminogen with urokinase-free plasmin at alkaline pH. Microplasminogen can 
be activated by urokinase and streptokinase to catalytically active microplasmin. Microplasmin 
consists of two polypeptide chains connected by disulfide bonds: one is the intact light chain, and 
10 the other is a peptide of 3 1 residues from the carboxyl-tenninal portion of the heavy chain (Shi, G- 
Y et al., J. Biol Chem. 263:17071-5 (1988) which disclosure is hereby incorporated by reference 
in its entirety). 

It is significant that the formation of plasminogen fragments such as miniplasmhiogen-like 
molecules has been observed under some pathophysiological conditions. Of particular note is a 

15 report that synovial fluid in acute inflammatory arthritis (including rheumatoid arthritis), unlike 
that of acute non-inflammatory arthritis (including osteoarthritis), contains low molecular weight 
fragments of plasminogen with the properties of miniplasminogen (Moroz, LA et al., Thrombosis 
Research 43:417-24 (1986) which disclosure is hereby incorporated by reference in its entirety). 
Whether neutrophil elastase or other mechanisms are responsible for their generation, the presence 

20 in inflamed joints of molecules with properties of miniplasminogen indicates a potential for their 
participation in inflammatory events where plasmin activity has been implicated, as in the 
activation of procollagenase to collagenase in rheumatoid synovium, but where the inhibitory 
. activity of alpha-2-antiplasmin has been invoked as an obstacle to such a view. However, the 
ability of molecules such as miniplasmin to escape such inhibition suggests the possibility that 

25 generation of miniplasmin might lead to activation of procollagenase, or destroy joint structural 
proteins directly. 

Plasminute is the product of alternative transcription initiation within the plasminogen 
gene. Transcription initiates within intron N (at least 1 03 6 nucleotides upstream of exon XV) and 
proceeds through the remainder of the plasminogen gene (Petersen, TE et al., J. Biol. Chem. 

30 265:6104-11 (1990); NCBI Accession No. AL 109933. 25 which disclosures are hereby 

incorporated by reference in their entirety). Splicing occurs normally between transcribed exons 
XV to XDC Translation initiates within exon XV and is carried out in the plasminogen open 
reading frame. Plasminute represents the carboxyl-terminal fragment of plasminogen 
corresponding to amino acids 585 to 790 (numbered from the amino-terminal glutamic acid residue 

35 of secreted plasminogen). 

Importantly, Plasminute is a variant of plasmin distinguished by the novel manner in which 
its protease activity escapes regulation. Plasminute retains the catalytic triad of plasmin (His603, 
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Asp646, Ser741, numbered from the arnino-terminal glutamic acid residue of secreted 
plasminogen). Plasminute manifests constitutive protease activity, circumventing the requirement 
for proteolytic activation by virtue of its translation initiating downstream of the cleavage site 
involved in the conversion to plasmin from plasminogen (amino acids 561-562 of secreted 
5 plasminogen, numbered from the amino terminal glutamic acid residue). In addition, the protease 
activity of Plasminute is relatively resistant to inhibition by the primary plasmin inhibitor, alpha-2- 
antiplasmin, by virtue of its translation initiating downstream of the plasminogen kringle domains. 

In a preferred embodiment, the present invention provides for a method of contacting 
Plasminute with a blood clot in patients with acute vascular disease. The advantage of Plasminute 

10 over plasminogen activators is two-fold: 1) it circumvents the necessity to generate plasmin. within 
the patient and therefore is more direct and controllable; and 2) it is not immediately neutralized by 
excess alpha-2-antiplasmin, as is the case for most of the plasmin generated through exogenously 
administered activator (US Patent 5,753,486; "Human tissue plasminogen activator;" which 
disclosure is hereby incorporated by reference in its entirety). Preferred compositions comprise 

15 Plasminute. Preferred mode of admistration is intravenous injection. 

In further preferred embodiment, the present invention provides for a method of contacting 
Plasminute with a blood clot in patients with diseases having an etiological basis pointing to either 
a partial or, in severe cases, total occlusion of a blood vessel by a blood clot — thrombus or 
thromboembolus. Further preferred is a method of contacting Plasminute with a blood clot in said 

20 patients for the purpose of dissolving said clot. Further preferred are compositions comprised of 
Plasminute used in methods of contacting a blood clot with an ameliorative effective amount in 
patients with acute vascular disease wherein the acute vascular disease is selected from, but not 
restricted to, the group consisting of: (a) Myocardial infarct; (b) Stroke; (c) Pulmonary embolism; 
(d) Deep vein thrombosis; (e) Peripheral arterial occlusion; and (f) Other venous thromboses. 

25 Plasmin plays an important role in wound healing, including recovery from myocardial 

infarction, skin wounds, and arterial neointima formation. In the course of myocardial infarction, 
cardiomyocytes die and a process that resembles wound healing in, for instance, skin wounds and 
requiring plasmin occurs (Creemers E, et al., Am. J. Pathol. 156:1865-73 (2000) which disclosure 
is hereby incorporated by reference in its entirety). Specifically with respect to skin wounds, 

30 plasmin is required for the efficient keratinocyte migration necessary for wound closure (Romer J 
. et al., Nat. Med. 2:287-92 (1996) which disclosure is incorporated by reference in its entirety). 
With respect to arterial neointima formation, plasmin is required for migration of smooth muscle 
cells into the necrotic center of the induced arterial wall injury (Carmeliet, P et al., J. Clin. Invest. 
99:200-8 (1997) which disclosure is incorporated by reference in its entirety). 

35 hi further embodiment, the present invention provides for compositions comprised of 

Plasminute used in methods of promoting wound healing. Further preferred are compositions 
comprised of Plasminute used in methods of contacting said wound with an ameliorative effective 
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amount wherein the wound is selected from, but not restricted to, the group consisting of: 
(a)Myocardial infarction; (b)Skin wound; and (c)Arterial wall injury. 

The compositions and methods for treatment of acute vascular disease and wound healing 
discussed above are not limited to use in humans, but can have veterinary applications as well. 
5 Partial digestion of a protein by plasmin is frequently exploited in in vitro biochemical 

analysis of said protein (Bewley, TA, Biochemistry 16:209-15 (1977); Nawratil, P et al., J. Biol. 
Chem. 271:31735-41 (1996); Kost, C et al., Eur. J. Biochem. 236:682-8 (1996); Angelloz-Nicoud, 
P et al., Growth Hormone and IGF Research 8:71-75 (1998); Itoh, Y et al., J. Biochem. 128:1017- 
24 (2000); which disclosures are hereby incorporated by reference in their entirety). For example, 

10 partial digestion by plasmin can be useful in assigning function to specific protein domains and in 
mapping antigenic epitopes onto the protein. Plasminute has utility over plasmin for said 
biochemical analysis in that: 1) production of Plasminute does not require proteolytic activation of 
plasminogen; and 2) the smaller size of Plasminute makes it easier to manipulate. 

Further preferred are compositions comprised of Plasminute used in methods of in vitro 

15 biochemical analysis of protein, including but not restricted to the analysis of protein function and 
antigenicity. Further preferred are compositions comprised of Plasminute used as part of a kit in 
methods of in vitro biochemical analysis of protein, including but not restricted to the analysis of 
protein function and antigenicity. 

In a preferred embodiment, the present invention provides for an antibody that specifically 

20 binds Plasminute of the present invention. Further preferred is a method of making said antibody 
wherein said antibody recognizes a non-conformational or conformational epitope of Plasminute. 
Further preferred is a method of making said antibody wherein said antibody neutralizes the serine 
protease activity of Plasminute or facilitates the elimination of Plasminute from tissue. 

Further preferred is a method wherein a mouse is immunized with Plasminute. Further 

25 preferred is a method wherein monoclonal antibodies from said mouse are screened for binding to 
Plasminute but not to plasmin or plasminogen. Further preferred is a method wherein monoclonal 
antibodies derived from said mouse are screened by enzyme-linked immunosorbent assay (ELISA) 
for binding to Plasminute but not to plasmin or plasminogen. Further preferred is a method 
wherein monoclonal antibodies from said mouse are screened for binding to Plasminute but not to 

30 plasmin, plasminogen, miniplasmin, miniplasminogen, microplasmin, or microplasminogen. 
Further preferred is a method wherein monoclonal antibodies derived from said mouse are 
screened by ELISA for binding to Plasminute but not to plasmin, plasminogen, miniplasmin, 
miniplasminogen, microplasmin, or microplasminogen. Further preferred is a method wherein said 
antibody is screened for the capacity to sterically or allosterically neutralize the serine protease 

3 5 activity of Plasminute. Further preferred is a method of humanizing said monoclonal antibody. 
Methods of generating said monoclonal antibody and of establishing specificity by methods 
including ELISA are well known to those skilled in the art. Methods of screening said antibody to 
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neutralize the serine protease activity of Plasminute are well known to those skilled in the art and 
include, but are not limited to: contacting the antibody with Plasminute, incubating the antibody- 
Plasminute complex with a substrate of Plasminute, and following proteolytic activation of fee 
Plasminute substrate. Methods of humanizing said monoclonal antibody are well known to those 
5 skilled in the art. 

The functionality of Plasminute is proinflammatory. Functional fragments of plasminogen 
at least as small as mirriplasminogen have been observed in synovial fluid in acute inflammatory 
arthritis but not in synovial fluid in acute non-inflammatory arthritis (Moroz, LA et al., Thrombosis 
Research 43:417-24 (1986) which disclosure is hereby incorporated by reference in its entirety). 

10 In a preferred embodiment, the present invention provides for a method of contacting said 

antibody and specifically binding it with Plasminute, Further preferred is a method for using said 
antibody diagnostically to determine the basis for inflammopathology. Further preferred is a 
method for using said antibody diagnostically in a sandwich ELIS A format to determine the level 
of Plasminute in plasma or other bodily fluid, including but not restricted to synovial fluid and 

15 cerebrospinal fluid, within a pathological context Further preferred is a method for using said 
antibody in a sandwich ELIS A format to determine the level of Plasminute in plasma or other 
bodily fluid, including but not restricted to synovial fluid and cerebrospinal fluid, from normal 
subjects in order to establish a baseline level of Plasminute. Further preferred is a method of using 
said diagnostic assay to determine the level of Plasminute in plasma or other bodily fluid of a 

20 patient with inflammopathology wherein the inflammopathology is selected from, but not 
restricted to, the group consisting of: (a) Atheriosclerosis; (b) Inflammatory bowel disease; 
(c) Insuline dependent diabetes mellitus (Type 1 diabetes); (d) Systemic lupus erythematosus; 
(e) Multiple sclerosis;Psoriasis; (f) Allergic asthma; (g) Septic shock; and (h) Reperfusion injury. 
In a preferred embodiment, the present invention provides for a method of contacting said 

25 antibody and specifically binding it with Plasminute. Further preferred is a method for using said 
antibody diagnostically to determine the basis for inflammatory arthritis. Further preferred is a 
method of using said diagnostic assay to determine the level of Plasminute in synovial fluid of a 
patient with acute inflammatory arthritis (a-d below) or acute non-inflammatory arthritis (e-f 
below). Plasminute level may be additionally useful is distinguishing the former from the latter 

30 (Moroz, LA et al., Thrombosis Research 43:417-24 (1986) which disclosure is hereby incorporated 
by reference in its entirety). 

In a preferred embodiment, the present invention provides for a method of contacting said 
antibody and specifically binding it with Plasminute. Further preferred is a method of using said 
diagnostic assay in said sandwich ELIS A format to determine the level of Plasminute in synovial 

35 fluid of a patient with acute inflammatory arthritis or acute non-inflammatory arthritis. Further 
preferred is a method of using said diagnostic assay to determine the level of Plasminute in 
synovial fluid of a patient with acute inflammatory arthritis or acute non-inflammatory arthritis 
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wherein the arthritis is selected from, but not restricted to, the group consisting of: (a) Rheumatoid 
arthritis; (b) Gout; (c) Septic arthritis; (d) Reiter's syndrome; (e) Osteoarthritis; and (f) Trauma. 

In a preferred embodiment, the present invention provides for a method of contacting said 
antibody and specifically binding it with Plasminute. Further preferred is a method of using said 

5 antibody diagnostically in an immunohistochemistry format to determine the level of Plasminute in 
affected tissue in a patient presenting with inflammopathology. Further preferred is a method of 
using said antibody diagnostically in an immunohistochemistry format to determine the level of 
Plasminute in affected tissue in a patient presenting with inflammopathology wherein said 
inflammopathology is selected from, but not restricted to, the group consisting of: 

10 (a) Inflammatory arthritis; (b) Atheriosclerosis; (c) Inflammatory bowel disease; (d) Insuline 
dependent diabetes mellitus (Type 1 diabetes); (e) Systemic lupus erythematosus; (f) Multiple 
sclerosis; (g) Psoriasis; (h) Allergic asthma; (i) Septic shock; and (j) Reperfusion injury. 

The components of the urokinase plasminogen activator system involved in conversion of 
plasminogen to plasmin are present in significantly higher amounts in malignant tumors than in 

15 normal tissue or benign tumors, and said elevated expression is related to poor prognosis for a 
.variety of patients diagnosed with tumors including breast, prostate, lung, or colon cancer 
. (Andreasen, PA et al., Cell. Mol. Life Sci. 57:25-40 (2000) which disclosure is hereby 

incorporated by reference in its entirety) (discussed in more detail below). The largest data sets 
correlating urokinase plasminogen activator level with patient prognosis are available for breast 

20 cancer. In the Western world, about one in every ten women will develop breast cancer. In a 
significant number of these patients, metastatic cells will have spread to the lymph nodes and other 
tissues by the time their breast tumor is diagnosed. Therefore, following surgery these patients will 
normally receive some kind of additional therapy aimed at reducing their risk of developing 
secondary cancer. 

25 Even for those patients whose lymph nodes are free of tumor cells (node negative), it is 

still important to know whether they are at high or low risk of developing secondary tumors. 
Measuring the levels of the components of the urokinase plasminogen activator system can assess 
this risk, high levels indicating a high risk of developing metastases and suggesting that patients 
should be treated with additional therapy. Just as elevated plasmin generation on engagement of 

30 the urokinase plasminogen activator system indicates high risk for metastases, elevated Plasminute 
expression by the tumor cells indicates high risk for metastases. In a preferred embodiment, the 
present invention provides for a method of contacting said antibody and specifically binding it with 
Plasminute. Further preferred is a method of using said antibody diagnostically in an 
immunohistochemistry format to determine the level of Plasminute in affected tissue in a patient 

35 presenting with cancer. Further preferred is a method of using said antibody diagnostically in an 
immunohistochemistry format to determine the level of Plasminute expressed by tumor cells in a 
patient presenting with cancer wherein said cancer is selected from, but not restricted to, the group 
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consisting of: (a) Melanoma; (b) Squamous cell carcinoma of the skin; (c) Breast carcinoma; 
(d) Lung small-cell carcinoma; (e) Colon carcinoma; (f) Hodgkin's lymphoma; (g) Non^-Hodgkin's 
lymphoma; (h) Prostatic carcinoma; (i) Pancreatic carcinoma; Q) Osteosarcoma; (k) Uterine 
carcinoma; (1) Ovarian carcinoma; (m) Chondrosarcoma; (n) Endometrial cancer; (o) Testicular 
5 carcinoma; (p) Renal carcinoma; (q) Hepatic carcinoma; (r) Lung non-small-cell carcinoma; (s) T 
lymphocyte acute lymphoblastic leukemia (T-ALL); (t) B lymphocyte acute lymphoblastic 
leukemia (B-ALL); (u) Acute myeloid leukemia (AML); (v) Chronic lymphocytic leukemia 
(CLL); and (w) Multiple myeloma. 

Viral hemorrhagic fevers are a group of diseases caused by viruses from four distinct 

10 families: filoviruses, arenaviruses, flaviviruses, and bunyaviruses. Virus driven expression of host 
Plasminute and a consequential hyperfibrraolysis may be contributory to at least some said viral 
pathologies. Furthermore, measuring Plasminute level may have diagnostic value in distinguishing 
between viruses in this group or have diagnostic value in distinguishing viruses belonging to this 
group from viruses not belonging to this group. 

15 In a preferred embodiment, the present invention provides for a method of contacting said 

antibody and specifically binding it with Plasminute. Further preferred is a method of using said 
diagnostic assay in sandwich ELISA format to determine the level of Plasminute in plasma or other 
bodily fluid in a patient suspected of having viral hemorrhagic fever. Further preferred is a method 
for using said diagnostic assay to determine the level of Plasminute in a patient infected by a virus 

20 not belonging to the group causing viral hemorrhagic fever, in order to establish a baseline level of 
Plasminute level in said other viral infection. Further preferred is a method of using said 
diagnostic assay to determine the level of Plasminute in plasma or other bodily fluid in patient 
suspected of having viral hemorrhagic fever when said virus is selected from, but is not restricted 
to, the group consisting of: (a) Ebola virus; (b) Omsk hemorrhagic fever virus; (c) Junin virus; 

25 (d) Marburg virus; (e) Crimean-Congo hemorrhagic fever virus; and (f) Dengue fever virus. 

The serine protease activity of Plasminute is relatively resistant to inhibition by the 
. primary plasmin inhibitor, alpha-2-antiplasmin, by virtue of its translation initiating downstream of 
the plasminogen kringle domains. In in vitro analysis of clinical samples, it is important to prevent 
artifactual proteolysis of the sample ex vivo, including that by plasmin or a derivative thereof. If 

30 the sample contains Plasminute, the current art of using alpha-2-antiplasmin to block said 
proteolysis would be inadequate. In this context, said antibody directed to Plasminute and 
neutralizing its serine protease activity would have utility In a preferred • 

embodiment, the present invention provides for a method of contacting said antibody and 
specifically binding it with Plasminute. Further preferred is a method of using said neutralizing 

35 anti-Plasminute antibody to block ex vivo proteolysis by Plasminute within clinical samples. 

There is precedent for the expression of two alternatively spliced transcripts derived from 
the same gene and encoding functionally distinct protein isoforms being reciprocally modulated by 
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cytokine. This is the case for monocyte expression of CD86, a T lymphocyte co-stimulator 
molecule, for example. Interferon gamma down-regulates monocyte expression of the 
alternatively spliced transcript encoding a truncated and interfering version of CD86 and up- 
regulates the spliced transcript encoding full-length CD86 (Magistrelli, G et al., Biochem. Biophys. 

5 Res. Commun. 280: 121 1-5 (2001) which disclosure is hereby incorporated by reference in its 
entirety). It is not unreasonable to expect, therefore, that there may be cytokine modulation of 
transcription initiation within a gene leading to alternative transcripts encoding functionally 
distinct protein isoforms, as is the case for the present invention. Identification of said cytokine 
regulation of alternative transcription initiation within a gene would be expected to have 

10 therapeutic value and to lead to a better understanding of disease pathology. 

In a preferred embodiment, the present invention provides for a method of contacting said 
antibody and specifically binding it with Plasminute. Further preferred is a method of using said 
antibody to characterize cytokine regulation of Plasminute expression by endothelial cells. Further 
preferred is a method of using said antibody in said sandwich ELISA to characterize cytokine 

15 regulation of Plasminute expression by endothelial cells. Further preferred is a method of using 
said antibody in said sandwich ELISA to characterize cytokine regulation of Plasminute expression 
by endothelial cells wherein the cytokine is selected from, but not restricted to, the group 
consisting of: (a) Interferon gamma; (b) Interleukin 17; (c) Interleukin 4; (d) Interleukin 10; 
(e) Interleukin 13; (f) Interleukin 15; (g) Interleukin 12; (h) Interleukin 18; (i) Interleukin 20; 

20 (j) Interleukin 21 ; (k) Interleukin 1 beta; (1) Merleukin 6; (m) Monocyte chemotactic protein 1 
(MCP-1); (n) RANTES; (o) IP-10; (p) Vascular endothelial growth factor (VEGF); 
(q) Transforming growth factor beta; (r) Interleukin 8; and (s) Tumor necrosis factor alpha. 

Methods of characterizing cytokine regulation of Plasminute expression by endothelial 
cells are well known to those skilled in the art and include, but are not limited to: incubation of 

25 endothelial cells with or without cytokine for 24-48 hours, collection of culture supernatant, and 
determination of Plasminute protein in the culture supernatant by sandwich ELISA. 

The transcript encoding Plasminute can be readily distinguished from that encoding 
plasminogen and its derivatives. Further preferred therefore is a method of directly characterizing 
cytokine regulation of Plasminute mRNA expression by endothelial cells. Further preferred is a 

30 method of using polynucleotide comprising Plasminute to determine the level of Plasminute 
mRNA in endothelial cells. Further preferred is a method of using polynucleotide comprising 
Plasminute to determine the level of Plasminute mRNA in endothelial cells that have been 
incubated in the presence or absence of cytokine for 0, 2, 4, 6, 8, 12, or 24 hours. Further preferred 
is a method of using a Plasminute cDNA fragment encoding 5 '-untranslated sequence derived from 

35 intron N as a specific probe in Northern blot analysis of said Plasminute mRNA level. Further 
preferred is a method of using a primer specified in Plasminute 5 '-untranslated sequence derived 
from intron N in conjunction with a primer specified in Plasminute 3 '-untranslated sequence to 
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specifically determine said Plasminute mRNA level by reverse transcriptase-polymerase chain 
reaction (RT-PCR). Methods of carrying out Northern blot analysis or RT-PCR on total or 
poly(A)+ RNA are well known to those in the art. 

The functionality of Plasminute is proinflammatory. In this context, it is significant that 
5 functional fragments of plasminogen at least as small as miniplasminogen have been observed in 
synovial fluid in acute inflammatory arthritis but not in synovial fluid in acute non-inflammatory 
arthritis (Moroz, LA et al., Thrombosis Research 43:417-24 (1986) which disclosure is hereby 
incorporated by reference in its entirety). Said neutralizing anti-Plasminute antibody would be 
expected to have therapeutic value in inflammopathologies in which Plasminute plays a role. 

10 In its capacity as a serine protease, plasmin plays a role in normal processes involving cell 

migration in tissue remodeling. In this regard, plasmin is believed to function in processes in 
which cell movement is essential, such as macrophage invasion in inflammation and angiogenesis.. 
Involvement of plasmin in these processes is supported by the ability of plasmin to degrade 
extracellular matrix proteins directly, such as proteoglycans, fibronectin, laminin, and type IV 

15 collagen, and/or be indirectly responsible for the degradation of matrix proteins through activation 
of metalloprotease zymogens, such as stromolysin and procollagenase. As a result of degradation 
of the extracellular matrix, cell migration into surrounding areas becomes more facile (Castellino, 
FJ in Molecular Basis of Thrombosis and Hemostasis, High, KA & Roberts, HR, editors, New 
York, pp 495-515 (1995) which disclosure is hereby incorporated by reference in its entirety). 

20 Neovascularization plays a role in a number of diseases, including but not limited to 

rheumatoid arthritis (Danis, RP et al., Expert Opin. Pharmacother. 2:395-407 (2001) which 
disclosure is hereby incorporated by reference in its entirety). 

In a further preferred embodiment, the present invention provides for a method of 
contacting and specifically binding to Plasminute said antibody having the capacity to neutralize 

25 the serine protease activity of Plasminute or to facilitate the elimination of Plasminute from tissue. 
Further preferred is a method of using said antibody in contact with Plasminute as a therapeutic for 
patients with inflammopathology. Preferred compositions comprise said Plasminute antibody or 
fragments or derivatives thereof. Preferred formulation of said composition is that compatible with 
the route of delivery wherein said route of delivery is selected from, but not restricted to the group 

30 consisting of: (a) Oral; (b) Transdermal; (c) Injection; (d) Buccal; and (e) Aerosol. 

In further preferred embodiment, the present invention provides for a method of contacting 
and specifically binding to Plasminute said antibody having the capacity to neutralize the serine 
protease activity of Plasminute or to facilitate the elimination of Plasminute from tissue. Further 
preferred is a method of using said Plasminute antibody to treat patients with inflammopathology. 

35 Further preferred is a method of using said composition comprised of said Plasminute antibody to 
ameliorate the symptoms or pathology associated with said inflammopathology. Said Plasminute 
antibody ameliorates the symptoms or pathology associated with said inflammopathology by 
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blocking the proteolytic remodeling of matrix that is directly or indirectly mediated by Plasminute 
and that facilitates the inflammatory process, including macrophage invasion, or angiogenesis that 
is associated with the pathology. Further preferred is a method of delivering to patients with said 
inflammopathology an ameliorative effective amount of said Plasminute antibody wherein said 
5 inflammopathology is selected from, but not restricted to, the group consisting of: (a) Rheumatoid 
arthritis; (b) Atheriosclerosis; (c) Inflammatory bowel disease; (d) Insulin dependent diabetes 
mellitus (Type 1 diabetes); (e) Systemic lupus erythematosus; (f) Multiple sclerosis; (g) Psoriasis; 
(h) Allergic asthma; (i) Septic shock; and (j) Reperfusion injury. 

Proliferative diabetic retinopathy (PDR) remains one of the major causes of aquired 
. 10 blindness in developed nations. The hallmark of PDR is neovascularization, abnormal 

angiogenesis that may ultimately cause severe vitreous cavity bleeding and/or retinal detachment. 
In a further embodiment of the invention, said composition comprised of neutralizing anti- 
Plasminute antibody is used in a method to treat patients with said PDR. 

In its capacity as a serine protease, plasmin plays a role in pathological processes of cell 

15 migration that are involved in tumor cell growth and invasion of surrounding tissue and, perhaps, 
metastases [Andreasen, PA et al., Cell. Mol. Life Sci. 57:25-40 (2000), which disclosure is hereby 
incorporated by reference in its entirety]. Involvement of plasmin in these processes is supported 
by the ability of plasmin to degrade extracellular matrix proteins directly, such as proteoglycans, 
fibronectin, laminin, and type IV collagen, and/or be indirectly responsible for the degradation of 

20 matrix proteins through activation of metalloprotease zymogens, such as stromolysin and 
procollagenase. As a result of degradation of the extracellular matrix, cell migration into 
surrounding areas becomes more facile (Castellino, FJ in Molecular Basis of Thrombosis and 
Hemostasis, High, KA & Roberts, HR, editors, New York, pp 495-515 (1995) which disclosure is 
hereby incorporated by reference in its entirety). 

25 The urokinase plasminogen activator system, and by implication plasmin, is associated 

with high risk of tumor invasiveness and metastates pConno, H et al., Jpn. J. Cancer Res. 92:5 16- 
23 (2001); Fisher, JL et al., Clin. Cancer Res. 7:1654-60 (2001); Vazquez-Rivera, F et al., 
Proceedings of the 1 1th NCI-EORTC-AACR Symposium, Abstract 294 (2000); Ellrieder, V et al., 
Annals of Oncology 10, suppl.4, 41-45 (1999); Smolarz, B et al., Med. Sci. Monit. 5:833-7 (1999); 

30 Romer, J et al., J. Invest. Dermatol. 1 16:353-8 (2001); Abe, J et al., Cancer 86:2602-1 1 (1999); 
Morii, T et al., Anticancer Res. 20(5A):3031-6 (2000); Tecimer, C et al., Gynecol. Oncol. 80:48-55 
(2001); Zheng, Q et al., J. Cancer Res. Clin. Oncol. 126:641-6 (2000); Swiercz, R et al. Oncol. 
Rep. 8:463-70 (2001); Borgfeldt, C et al. Int. J. Cancer 92:497-502 (2001); He, C et al., J. Cancer 
Res. Clin. Oncol. 127:180-6 (2001); which disclosures are hereby incorporated by reference in 

35 their entirety]. 

In further preferred embodiment, the present invention provides for a method of contacting 
and specifically binding to Plasminute said antibody having the capacity to neutralize the serine 
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protease activity of Plasminute or to facilitate the elimination of Plasminute from tissue. Further 
preferred is a method of using said Plasminute antibody to treat patients with cancer. Further 
preferred is a method of using said composition comprised of said Plasminute antibody to 
ameliorate the symptoms or pathology associated with said cancer. Said Plasminute antibody 
5 ameliorates the symptoms or pathology associated with said cancer by blocking the proteolytic 
remodeling of matrix that is directly or indirectly mediated by Plasminute and that facilitates the 
invasive and metastatic processes or angiogenesis that is associated with the pathology. Further 
preferred is a method of delivering said composition comprised of said Plasminute antibody by 
intravenous injection. Further preferred is a method of delivering to patients with said cancer an 

10 ameliorative effective amount of said Plasminute antibody wherein said cancer is selected from, 
but not restricted to, the group consisting of: (a) Melanoma; (b) Squamous cell carcinoma of the 
skin; (c) Breast carcinoma; (d) Lung small-cell carcinoma; (e) Colon carcinoma; (f) Hodgkin's 
lymphoma; (g) Non-Hodgkin's lymphoma; (h) Prostatic carcinoma; (i) Pancreatic carcinoma; 
(j) Osteosarcoma; (k) Uterine carcinoma; (m) Ovarian carcinoma; (n) Chondrosarcoma; 

15 (o) Endometrial cancer; (p) Testicular carcinoma; (q) Renal carcinoma; (r) Hepatic carcinoma; 
(s) Lung non-small-cell carcinoma; (t) T lymphocyte acute lymphoblastic leukemia (T-ALL); (u) B 
lymphocyte acute lymphoblastic leukemia (B-ALL); (v) Acute myeloid leukemia (AML); 
(w) Chronic lymphocytic leukemia (CLL); and (x) Multiple myeloma. 

In a further preferred embodiment, the present invention provides for a method of 

20 screening test compounds for the ability to bind Plasminute and specifically neutralize the serine 
protease activity of Plasminute. Further preferred are said test compounds that bind to either a 
non-conformational or conformational site on Plasminute. Further preferred are test compounds 
that neutralize said serine protease activity of Plasminute either sterically or allosterically. Further 
preferred is a method of screening said test compounds for the capacity to neutralize said serine 

25 protease activity of Plasminute. Methods of screening said test compounds for the capacity to 
neutralize said serine protease activity of Plasminute are well known to those skilled in the art and 
include, but are not limited to: contacting the test compound with Plasminute, incubating the test 
compound-Plasminute complex with a substrate of Plasminute, and following proteolytic 
activation of the Plasminute substrate. 

30 Preferred formulations of said compound are those selected from, but not restricted to, the 

group consisting of: (a) Oral; (b) Transdermal; (c) Injection; (d) Buccal; and (e) Aerosol. 

Said compounds found to bind to and specifically neutralize the serine protease activity of 
Plasminute are used in methods analogous to those described above for neutralizing anti- 
Plasminute antibody. 

35 Protein of SEQ ID NO:56 (internal designation Clone 519757_184-4-2-0-F7-F) 

The cDNA of clone 51 9757J 84-4-2-0-F7-F (SEQ ID NO:55) encodes the human 
intracellular signaling protein comprising the amino acid sequence: 
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MLEVSDALGGPGRWGATAGMNGVDTSLLC^ 

YKEATLTMDQVSSIPALRVNPFRDRICRWSHKGM^ 

FRIYDFNENGFIDEEDLQRIILRLLNSDDMSEDLLMDLT^ 

MAKSPDFMNSFRIHFWGC and shares features with the Calcium and Integrin-Binding (CIB)- 

5 and the DNA-dependent kinase interacting (KIP) protein. It will be appreciated that all 
characteristics and uses of the polynucleotides of SEQ ID NO:55 and polypeptides of SEQ ID 
NO:56, described throughout the present application also pertain to the human cDNA of clone 
5 19757_1 84-4-2-0-F7-F and polypeptide fragments encoded thereby. Polypeptide fragments 
having a biological activity described herein and polynucleotides encoding the same are included 

10 in the present invention. Related polypeptide sequences included in the present invention are 
MGQCLRYQMH 
WEDLEEYQALTFLTRNEILCIH^ 
CRWSHKGMFSFEDVLGMASWSEQACT 
NSDDMSEDLLMDLTNHVLSE 

15 The gene of SEQ ID: 55 is located on chromosome 2, is ubiquitously expressed, has two 

EF-hand calcium- and zinc-binding domains, regulates Ca 2+ -dependent dephosphorylation 
processes such as neuronal transmission, muscle glycogen metabolism, and lymphocyte activation 
and is hereby referred to as CALSIGN. CALSIGN stimulates signaling processes which lead to 
platelet aggregation and blood clot formation. It binds to the cytoplasmic domain of integrins and 

20 regulates integrin function in physiological processes via the fibrinogen receptor (integrin a^Ps), 
which is expressed on platelets, and thereby activates integrin for binding to fibrinogen, 
fibronectin, the von-Willebrand factor, vitronectin, and thrombospondin; it also binds to the 
interferon 1 -receptor and contributes to signal transduction events in platelets, which lead to strong 
cell-cell adhesion, platelet aggregation, and blood clot formation. CALSIGN also facilitates 

25 immune responses via restoration of surface antigen expression and T-cell activation in response to 
viral- and bacterial infections and to endogenous factors. Further characteristics of CALSIGN 
comprise VDJ-recombination in B-cell maturation and surface antigen expression on mature B- 
cells [Naik et al, JSiol.Chem.272:4651-4654, 1997; PCT WO 98/14471, 1998; Wu and Lieber, 
MutatRes.385:13-20, 1997; PCT WO 98/31796, 1998; Hynes, Cell 69:1 1-25, 1992; Smyth et al., 

30 Blood 81:2827-2843, 1993; US Patent 6,093,565, 2000, which references are hereby incorporated 
in their entirety]. 

In a preferred embodiment, CALSIGN or other polypeptides of the invention are used in a 
method for tissue regeneration and wound healing after injuries. Wounds, in particular those 
occurring in the skin as second and third degree burns, stasis ulcers, trophic lesions such as 
35 decubitus ulcers, severe cuts and abrasions, which are commonly resistant to natural healing 
processes, may be treated with a composition comprising CALSIGN or other polypeptides 
included in the invention, or fragments thereof, in a formulation, which might include a growth 
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factor such as platelet derived growth factor (PDGF) or connective tissue growth factor (CTGF), 
or a wound dressing with aseptic properties such as silver-coated fibers (US Patent 6,149,916, 
2000; US patent 6,187,743, 2001; US Patent 6,087,549, 2000), which references are hereby 
incorporated in their entirety. 

5 The process of wound healing consists of three phases during which the injured tissue is 

repaired, regenerated, and new tissue is reorganized into a scar. These three phases are classified 
as: a) an inflammation phase which begins from day 0 to 3 days, b) a cellular proliferation phase 
from 3 to 12 days, and c) a remodeling phase from 3 days to about 6 months. In all three phases, 
antioxidants play a vital role in the healing process. In the inflammation phase, inflammatory 

1 0 cells, mostly neutrophils, enter the site of the wound followed by lymphocytes, monocytes, and 
later macrophages. The neutrophils that are stimulated begin to release proteases and reactive 
oxygen species into the surrounding medium with potential adverse effects on both the adjacent 
tissues and the invading microorganisms. The oxygen species known to be released by the 
neutrophils are superoxide (0.sub.2.sup.-) through the action of a plasma membrane-bound 

1 5 NADPH oxidase, hydrogen peroxide (H.sub.2 O.sub.2) formed by action of dismutation of 
0.sub.2.sup.-, and HOC1 produced by the action of myeloperoxidase with H.sub.2 O.sub.2. 

The proliferative phase consists of laying down new granulation tissue, and the formation 
of new blood vessels in the injured area. The fibroblasts, endothelial cells, and epithelial cells 
migrate in the wound site. These fibroblasts produce the collagen that is necessary for wound 
. 20 repair. Ascorbic acid is crucial in the formation of collagen. Several studies have demonstrated 
that ascorbic acid was capable of overcoming the reduced proliferative capacity of elderly dermal 
fibroblasts, as well as increasing collagen synthesis in elderly cells by similar degrees as in 
newborn cells even though the basal levels of collagen synthesis are age dependent A decrease of 
ascorbic acid at the injury area will decrease the rate of wound healing. In reepithelialization, 

25 epithelial cells migrate from the free edges of the tissue across the wound. This event is succeeded 
by the proliferation of epithelial cells at the periphery of the wound. Research has also shown that 
reepithelialization is enhanced by the presence of occlusive wound dressings which maintain a 
moisture barrer. The final phase of wound healing, which is remodeling, is effected by both the 
replacement of granulation tissue with collagen and elastin fibers and the devascularization of the 

30 granulation tissue. Recent studies have shown that topical application of antioxidants, especially 
alpha-tocopherol, reduces scarring and normalizes blood coagulation during therapy. 

A particularly effective healing treatment for wounds and skin defects such as bums, ulcers 
and lesions is the application of a medicinal dressing containing as an essential ingredient starch 
hydrolysate having Dextrose Equivalent of less than about 35 . In such wound treatment the starch 

35 hydrolysate produces the formation of a film which is intimately adhered to the underlying 

granulation tissue and which is semi-permeable to gas and fluids and provides an ideal protective 
cover that will reduce fluid and plasma losses and invasion by pathogenic bacteria. In addition, it 
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appears that the starch hydrolysate provides a topical or local hyperalimentation, that is local 
nutrition, providing a gradual release of glucose which is particularly effective in nutrition of 
tissue, both damaged and nascent, which have become relatively isolated from normal blood flow 
nutrition. The cessation of blood flow to such an ischemic lesion can be developed in a slow and 
5 gradual form such as in the case of decubitus ulcers and stasis ulcers, or may take place more 
acutely such as in thermo-radiation and chemical burns. In the absence of nutrition, the rate of 
fluid delivery of nutrients decreases bringing a progressive impairment in the viability of cells and 
tissues. This eventually leads to degeneration and death of the tissue and cells in a condition 
known as necrosis. Necrosis is generally accompanied by bacterial, fungal and/or viral 

1 0 contamination. As further pointed out in the aforementioned patent, treatment of exudative skin 
wounds with a starch hydrolysate dressing produces a greatly reduced bacteria count of an infected 
wound and inhibits infection of an uninfected wound. In addition, application of the starch 
hydrolysate to a wound or ulcer produces a film or semi-permeable membrane which allows 
edematous liquid to pass through while proteinaceous material is retained within the body, 

1 5 allowing reduction in the volume of exudate in relatively clean condition. 

Compositions which enhance and promote the wound healing process comprise 
suspensions of CALSIGN, said fibrous protein, collagen, and a polysaccharide such as a 
glycosaminoglycan, which exhibits chemotaxis for fibroblasts or endothelial cells; the preferred 
glycosaminoglycans are said to be heparin, heparan sulfate, or alginate; collagen type I, vitamins 

20 .such as ascorbic acid (vitamin C) and alpha-tocopherol (vitamin E), and particulate starch 

hydrolysate are applied on wounds to promote the formation and growth of healthy granulation 
tissue. Wound healing processes will be significantly improved by multilayer laminate "wound 
dressings "comprising alternate layers of silver or silver-coated fibers and non-metalized fibers, 
which promote cellular proliferation and comprise antibacterial, antifungal, and analgesic 

25 properties (US Patent 6,087,549, 2000). The repair process for even minor breaches or ruptures 
takes a period of time extending from hours and days to weeks; and in some instances, as in 
ulceration, the breach or rupture may persist for extended periods of time, i.e., months or even 
years. At all times, be it brief or extended, the potential for invasion by pathogenic organisms or 
foreign substances continues until new tissue has been generated to fully close the rupture or 

30 breach. Becauseof the danger of infections, the customary management of wounds includes an 
initial thorough cleansing of the affected area to remove any contaminants such as dirt, cloth 
particles, or other debris that may introduce pathogenic materials. Any hopelessly damaged tissues 
may be debrided and antiseptic materials are applied to make the area as sterile as possible. If 
considered necessary, sutures may be used to reduce the area of the underlying tissues and thereby 

35 limit the amount of tissue exposed to subsequent contamination. The healing process is brought 
about by complex biological mechanisms generally involving several groups of special cells and 
proteins. Leukocytes, such as neutrophils and macrophages, crown the wound site and digest 
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foreign pathogens and debris. Such cells also send out chemical signals that marshal fibroblasts in 
the wound vicinity and ultimately generate connective structures, principally, collagen, which 
make up a major portion of the new tissues. Endothelial cells generate new blood capillaries that 
grow into the reconstructed tissue areas where their presence is necessary to supply nutrients to the 
5 newly growing tissue cells and remove catabolic products. As the new capillaries grow, the cells 
on the margin of the wound simultaneously multiply and grow inwardly. The fibrous tissue arising 
from this cell growth eventually fills the wound cavity with a network of interlacing threads of 
collagen which in due time, arrange themselves in firm bands and form the permanent new tissue. 
Said method for promoting wound healing comprises the steps of: 
1 0 Applying to the wound a composition of a therapeutically effective concentration of 

CALSIGN or other polypeptides included in the invention in an aqueous suspension with bovine 
collagen type I and aipha-tocopherol in a mixture with starch hydrolysate of a low dextrose 
equivalent DE, wherein said composition is chemotactic for fibroblasts and endothelial cells. Said 
■ bovine collagen is pre-treated to remove extraneous proteinaceous material by various dissolution, 
1 5 precipitation and filtration techniques to provide pure collagenous product. 

The composition may be combined with a combination of vitamins such as vitamin C and 
vitamin E, and with a therapeutically effective concentration of a purified connective tissue growth 
factor (CTGF), and platelet derived growth factor (PDGF). 

Said aquous suspension is applied repeatedly to the wound during the healing to effectively 
20 promote the healing process 

Said aqueous suspension may be combined with said multilaminate silver dressing for the 
treatment of postoperative wounds 

These embodiments also include the production of an antibody against CALSIGN and 
other polypeptides of the invention, wherein said antibodies can be polyclonal or monoclonal. For 
25 the production of recombinant CALSIGN, an expression vector and a corresponding cell system 
will be used, wherein the expression system can be prokaryotic such as E.coli, and eukaryotic such 
as Baculovirus/insect cells, or mammalian systems as well-known in the art 
Protein of SEQ ID NO:58 (internal designation Clone 625004_188-15-4-0-H6-F) 

The cDNA of clone (SEQ ID NO:57) encodes the protein of SEQ ID NO:58, comprising 
30 the sequence: 

MGPPGFKGKTGHPGIJGPKGDCGKPGPPGSTGRPGAEGEPGAMGPQGRPGPPGHVGPPGP 
PGQPGPAGISAVGLKGDRGATGERGLAGLPGQPGPPGPQGPPGYGKMGATGMGQQGIPGI 
PGPPGPMGQPGKAGHCNPSDCFGAMPMEQQYPPMKTMKGPFG 

Accordingly, it will be appreciated that all characteristics and uses of polypeptides of SEQ 
35 ID NO:58 described throughout the present application also pertain to the polypeptides encoded by 
the nucleic acids included in clone 625004_188-15-4-0-H6-F. In addition, it will be appreciated 
that all characteristics and uses of the polynucleotides of SEQ ID NO:57 described throughout the 
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present application also pertain to the nucleic acids included in clone 625004_l88-15-4-0-H6-F. 
Also preferred are fragments having a biological activity as described herein and the 
polynucleotides encoding the fragments. 

The cDNA of SEQ ID NO:57 is a novel splice variant of the human alpha 1 type XVI 
5 collagen gene (GB M92642.1) located on chromosome 1, specifically in the p34-35 region. The 
cDNA clone of SEQ ID NO:57 encodes an open reading frame of 489 nucleotides. Whereas the 
native form of human alpha 1 type XVI collagen possess 71 exons encoding a 1603 amino-acid 
protein, the cDNA of SEQ ID NO:57 contains 14 exons and encodes a 163 amino-acid protein of 
SEQ ID NO:58. The present protein represents the first described variant of the human alpha 1 

10 type XVI collagen, named vCOL16Al. The present protein contains two collagen triple helix 
repeat domains (positions 1 1-70 and 73-131). 

Collagens represent a large family of structurally related proteins that to date includes 
more than 20 collagen types. These proteins constitute the major extracellular matrix components 
of connective tissues and play a dominant role in maintaining the structural integrity of various 

15 tissues and also have a number of other important functions. Collagens can be divided into two 
major classes: the fibril-forming collagens and the non-fibril-forming collagens; the latter class 
includes a subgroup named the fibril-associated collagens with interrupted triple helices (FACTT). 
The human alpha 1 type XVI collagen exhibits most of the characteristics of the proteins of the 
non-fibril-forming collagen class. 

20 In one embodiment, the protein of the invention or fragment thereof provide an in vitro 

assay to test the specific activity of various proteases which degrade or denature collagen, such as 
collagenases and many others. Methods to assess the activity of such proteases include the steps of 
contacting the protease to be tested with the present protein, and detecting the amount of 
proteolytic cleavage of the present protein that occurs. 

25 Since collagen fibrils are often heterogenous structures containing more than one collagen 

type, the present invention provides a method to determine the types of collagen present in a tissue 
or biological sample. For example, the collagen composition of a diseased tissue can be 
determined by isolating the present protein under conditions that do not disrupt protein-protein 
interactions, and determining the identity of proteins associated with the present protein. Such 

30 associated proteins can be identified by any standard method including, but not limited to, 
immunoprecipitation and immuno-affinity columns. 

The present invention also provides animal models generated by modulating the expression 
or activity of the present protein in one or more tissues of the animal. Such animals are useful for a 
number of purposes, for example because they represent an in vivo assay method for testing 

35 candidate molecules potentially useful for the treatment of various pathophysiological aspects of 
diseases associated with abnormal collagen metabolism specifically related to the activity of the 
present protein. Study of the phenotype of such models can also allow the identification of 
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additional human equivalent diseases caused by or linked with collagen mutations. These animals 
can be generated with any method of targeting overexpression or inactivation of the present 
protein. In one such embodiment, purified forms of the present protein are injected into the joints 
of an animal, or the protein is recombinantly expressed in the joints, to provoke "collagen induced 
5 arthritis" in the joints, a well known model for arthritis. Such models are extremely useful, e.g. in 
the assessment of candidate therapies and drugs for the treatment of arthritis and other 
inflammatory diseases and conditions. 

In other embodiment, the protein of the invention or fragment thereof is rised to diagnose 
diseases or disorders associated with abnormalities of the metabolism of collagen. Examples of 

10 such diseases and disorders include, but are not limited to, hereditary nephritis of Alport's type due 
to a defect in collagen assembly that lead to progressive renal failure, disorders of bone tissue 
comprising osteoporosis, Paget's disease, disorders of cartilage tissue occurring in arthritis (such as 
osteo-arthritis and rheumatoid arthritis), disorders of the cardiovascular system prominent in 
atherosclerosis, hypertension, myocardial infarction and hypertrophy. This method includes the 

15 steps of contacting a biological sample obtained from an individual suspected of suffering from the 
disease or condition, or at risk of developing the disease or condition, with a compound capable of 
selectively binding the present protein or nucleic acids, e.g. a polyclonal or monoclonal antibody or 
any immunologically active fragment thereof, a nucleic acid probe, etc., and detecting the level, 
spatial distribution, or any other detectable property of the present protein in the sample, where a 

20 difference in the level, spatial distibution, or other property in the sample relative to in a control 
sample indicates the presence of the disease or disorder, or of a propensity for developing the 
disease or disorder. 

A further embodiment of the present invention is to provide novel methods and 
compositions useful for the treatment of diseases and conditions associated with collagen matrix 

25 destruction, including for wound treatment, including fractures. Such methods comprise the 
administration of a therapeutically-effective amount of the present protein to a patient suffering 
from the disease or condition. Preferably, the protein is administered directly to the site of 
collagen matrix destruction. The methods and compositions can also be used in, for example, the 
restoration of surgically induced wounds, or for the correction of physiological malfunction, for 

30 example to control urinary incontinence and more specifically for intrinsic sphincter deficiency. In 
such methods, the present protein can be administered by peri-urethral injection to reduce lumen 
aperture. These compositions can comprise the protein of the invention, and, optionally, one or 
more other types of collagen, collagen derivatives, or any other compound of interest. All of these 
components may be either obtained from natural sources or produced by recombinant genetic 

35 engineering techniques and/or chemical modification. 

Since aberrant degradation of collagen is an indication of disorders of connective tissues, 
another embodiment the present invention is to provide an assay for the monitoring of collagen 
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degradation in vivo. The invention thus includes test kits useful for the quantification in a 
biological sample of the amount of collagen fragment derived from the degradation of collagen, i.e. 
the degradation of the present protein. The kits comprise at least one immunological binding 
partner, e.g. a monoclonal or polyclonal antibody specific for a peptide derived from the 

5 degradation of the present protein or the intact present protein and coupled to detectable markers. 
Collagen degradation can be measured effectively in plasma, serum or blood by any suitable 
method, including immunoassays. Thus, the condition of a subject can be monitored continuously 
and the quantified amount of collagen fragments measured in the pathological sample can be 
compared with the amount quantified in a biological sample of a normal individual. 

10 In this embodiment, the application of such assays can be used to monitor the progress of 

therapy administered to treat these or other conditions. Further, the assays can be used as a 
measure of toxicity, since the administration of toxic substances often results in tissue degradation. 
It can also be used during clinical testing of new drugs to assess the impact of these drugs on 
collagen metabolism. Thus the assays may be applied in any situation wherein the metabolic 

15 condition of collagen tissues can be used as an index of the condition, treatment, or effect of 
substances directly administered to the subject or to which the subject is exposed in the 
environment. 

Also in this embodiment, the present invention provides a method of detecting the 
presence and/or monitoring the metastatic progress of a malignancy. Indeed, metastatic potential 

20 can be influenced both positively and negatively by a variety of cell surface adhesive molecules 
that act both independently and in concert with connective tissue elements such as collagen, 
allowing subsequent growth of tumor cells at secondary sites in particular tissues. The invention 
thus includes test kits useful for quantify the amount of the present protein or any specifically 
associated collagen type in a biological sample comprising the steps of contacting the biological 

25 sample with a specific monoclonal or polyclonal antibody specific for the present protein or any 
specifically associated collagen type, and coupled to detectable markers. Thus, the condition of a 
patient can be monitored continuously and the quantified amount of such proteins measured in the 
pathological sample can be compared with the amount quantified in a biological sample of a 
normal individual or with the previous analysis of the same patient. 

30 Excessive production and deposition of collagen leads to fibrosis and thereby impairs the 

normal functioning of the affected organ and tissues. There are numerous examples of fibrosis, 
including the formation of scar tissue following a heart attack, which impairs the ability of the 
heart to pump. Diabetes frequently causes damage/scarring in the kidneys which leads to a 
progressive loss of kidney function. Even after surgery, scar tissue can form between internal 

35 organs causing contracture, pain, and in some cases, infertility. Thus, the present invention 
provides a method to inhibit collagen accumulation, specifically the accumulation of the present 
protein, and thereby to avoid delayed healing. The level of the present protein can be inhibited or 
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decreased using any of a number of methods, including using antisense molecules or ribozymes, or 
alternatively the activity of the present protein can be inhibited using direct or indirect inhibitor 
molecules or antagonistic antibodies directed against the present protein. The inhibition of the 
expression or the activity of the present protein is also useful in the treatment of acute fibrosis (in 
5 response to various forms of trauma including injuries, infections, surgery, burns, radiation, 
chemotherapy treatments) or in the treatment of chronic fibrosis of the most commonly affected 
organs (heart, liver, kidney, lung, eye and skin), e.g. induced by viral infection, diabetes, 
hypertension or other chronic conditions. 

In another embodiment, the invention is useful for preparing cosmetic compositions such 

10 as skin creams with anti-wrinkle activity. Cosmetic applications also include the use of the present 
invention as a dermal implant to increase tissue size by injections of collagenous suspensions 
following eyebrow uplift, for lip augmentation and to rectify facial defects, frown lines and acne 
scars. The present protein can be used as an injectible biomaterial as a dermal implant to increase 
tissue size for cosmetic (wrinkle reduction). The protein of the invention is held to be an ideal 

15 biomaterial due to its ability to persist in the body long enough to carry out its specific role without 
developing a foreign body response that could lead to the premature rejection or overall failure of 
the biomaterial. These compositions can comprise purified forms of the present protein and, 
optionally, one or more other types of collagen or collagen derivatives. All of these components 
may be either obtained from natural sources or produced by recombinant genetic engineering 

20 techniques and/or chemical modification. 

The present invention can also be used in a variety of applications as a food source. Since 
the transmission risks of bovine spongiform encephalopathy to humans from various commonly 
used bovine derived products, such as bovine collagen, are still unclear, there is a need for 
alternative products to replace bovine derived products. Thus, another advantage of the present 

25 invention is derived from the fact that it is a human collagen rather than an animal-derived 
collagen. It is useful for making a casing for food products that are usually sausages, but the 
present invention can also be applied to any type of material including animal meat, fish meat, 
shellfish, and fish eggs, such as salmon roe, cheese, noodles. In addition, the present protein can 
be used as the binder element instead of caseins, which have been considered in the art to be 

30 indispensable for obtaining satisfactory binding strength in bound food. Thus, consumers who are 
allergic to these proteins can enjoy the bound food prepared containing the present invention 
without the fear of having an allergic reaction. The use of the present invention is also attractive 
for pet food, for example dogs or cats, and can be even more so if it is combined with solid 
products conventionally used in animal nutrition, for example pieces of meat or fish, and/or 

35 extruded cereals and/or extruded proteins. In a such embodiment, the present invention can be 
deliverable as a mixture, including, but not limited to, in a fluidized state, as a mixture in a gel 
state, in a freeze-dried state, or in a salt-precipitated state. 
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In another embodiment, the present protein can be used as a biomaterial for tissue 
engineering, to regenerate or replace damaged tissues. The present invention thus provides various 
clinical applications for the generation of tissues or organs unable to repair or regenerate 
themselves. It can be used, for example, to promote bone regeneration, to repair tendons, 

5 ligaments or cartilage, to generate blood vessels or heart valves, to create dental implants, but also 
in burn injuries, for dermal replacement in chronically unstable scars, after skin loss for hereditary, 
traumatic or oncological reasons, or for corneal reconstruction (see, e.g. Atala, (2000) J Endourol 
Feb;14(l):49-57; Schwartzmann (2000) Implant Dent 9(l):63-6; Machens et al.(2000) Cells 
Tissues Organs 167(2-3):88-94; the disclosures of which are hereby incorporated by reference in 

10 their entireties). The present invention is suited to the culturing of three-dimensional mammalian 
tissues for purposes including transplantation or implantation in vivo, and as the primary 
component of an extracorporeal organ assist device. Methods are also provided involving stem 
cells, for example pluripotential cells, which can differentiate into various tissue types (muscle, 
cartilage, skin, bone, etc) when stimulated by an appropriate environment, e.g. comprising the 

1 5 present protein. For example, stem cells can be expanded in vitro and suspended in collagen gel 
. matrices to form composites. The resulting composites will be implanted in a gap defect as a graft, 
which after remodeling in vivo, becomes populated with host cells and recapitulates normal 
functional architecture. In this embodiment, these substitutes can also serve as in vitro models for 
toxicology testing to better understand the response and healing mechanisms in human tissues. 

20 Protein of SEQ ID NO:60 (internal designation Clone 422353_145-ll-3-0-E7-F) 

The cDNA of SEQ ID NO:59 encodes the protein of SEQ ID NO:60, comprising the sequence: 
MCFPKVLSDDMKKLKARMHQAJERFYDm 

AAYYEEQHPELTPLLEKERDGL^ • 
RLQTWWHGVLAWVKEK^ 

25 LTPQKCSEPQSSK. Accordingly, it will be appreciated that all characteristics and uses of the 
polypeptide of SEQ ID NO:60 described throughout the present application also pertain to the 
polypeptide encoded by the nucleic acids included in clone 422353_145-1 1-3-0-E7-F. In addition, 
it will be appreciated that all characteristics and uses of the nucleic acid of SEQ ID NO:59 
described throughout the present application also pertain to the nucleic acids included in clone 

30 422353_145-1 1-3-0-E7-F. A preferred embodiment of the invention is directed toward the 
compositions of SEQ ID NO:59, SEQ IDNO:60, and Clone 422353 J45-1 1-3-0-E7-F. Also 
preferred are polypeptide fragments having a biological activity as described herein and the 
polynucleotides encoding the fragments. 

The protein of SEQ ID NO:60 (NK5) is a novel splice variant of the human Natural Killer 

35 cells protein 4 precursor (NK4) (Genbank accession number M59807). NK5 is a 188-amino-acid- 
long protein that displays an RGD cell-attachment sequence from positions 170 to 172. An epitope, 
located from positions 163 to 187, overlaps this RGD motif. NK5 displays a putative trans- 
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membrane domain from positions 148 to 168. Contrarily In contrast to NK4, NK5 displays no 
signal peptide. The NK4 cDNA contains 6 exons (Bernot et alet a/., Genomics 50:147-60 (1998)), 
whereas the NK5 cDNA contains 7 exons. Exons 1 and 2 are identical for NK4 and NK5, and 
exons 5, 6 and 7 of NK5 are identical to exons 4, 5 and 6 of NK4. Exon 3 of NK5 is shorter than 
5 exon 3 of NK4, and exon 4 is unique for NK5 . 

NK4 gene expression is ubiquitous (Bemot et al, Genomics 50:147-60 (1998)). 
Nevertheless, its expression is greatly increased in mitogen-activated T cells and in EL-2-activated 
Natural Killer cells (Dahl et al, J. Immun. 148:597-603 (1992)). 

Natural killer (NK) cells and T cells provide anti-infectious, antineoplastic, and 

10 immunomodulatory function effected by both cytokine production and direct cellular cytotoxicity. 
In particular, NK cells play a primary role in preventing and removing cancer cells in the body, 
removing many types of viruses (including herpes and measles) and have been found to be present 
at low levels in women with endometriosis. Moreover, in addition to these overtly immuno- 
protective functions, NK cells also mediate a variety of homeostatic functions, particularly in the 

1 5 regulation of haematopoesis and they may have an important role to play in the maintenance and 
development of placentation. The behaviour of NK and T cells in these various situations is 
regulated by a large number of distinct receptors that transmit positive and negative signals. 
Resting NK and T cells express a number of surface molecules which, when stimulated, can 
activate the cytotoxic mechanism. The balance of these signals determines whether an NK or T 

20 cell does nothing or is activated to proliferate, kill or secrete a wide range of cytokines and 
chemokines. More particularely, IL-2 activates many NK-cell functions, including baseline or 
"natural" anti-tumor cytotoxicity, antibody-dependent cellular cytotoxicity (ADCC), proliferation, 
and cytokine production (Trinchieri, Adv. Immunol. 47:1 87-376 (1989)), and IL-2-activated NK 
cells display a broader spectrum of reactivity against human and murine tumor target cells. 

25 The RGD motif, which is found in a number of proteins, has been shown to play a role in cell 
adhesion. It was shown that anchorage of NK cells is necessary for full activation (Li et alet al, J 
Immunother 20: 123-30 (1997)), and thatlong term-activated NK cells acquire new adhesive 
properties. This suggests a central role for RGD recognition in the regulation of immune 
responses. 

30 The expression of the NK5 gene is greatly increased in IL2-activated NK cells and in 

mitogen mitogen-activated T cells, and thus likely plays an important role in lymphocyte 
activation. In particular, NK5 is believed to play a role in the new adhesive properties that are 
acquired by activated lymphocytes. As NK5 does not display a signal peptide, NK5 likely plays a 
distinct role from NK4 in this process. 

35 An embodiment of the present invention relates to methods of using NK5 or fragment 

thereof as a marker to selectively detect and/or quantify activated T cells and/or activated NK cells. 
Any method of detecting the presence, level, or activity of NK5 can be used in such methods. For 
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example, the protein of the invention or fragment thereof may be used to generate specific 
antibodies using standard methods, and the antibodies can be used to detect the level of the present 
protein in a NK cell or a T cell, wherein a detection of a higher level of the present protein in the 
cell compared to a control level representative of a resting T cell or NK cell indicates that the cell 
5 is activated. Preferably, the antibodies are either directly or indirectly labeled, and bind more 
specifically to NK5 than to related proteins such as NK4. Alternatively, the nucleic acid of the 
invention or fragment thereof may be used to synthesize specific probes using any technique 
known to those skilled in the art. Such antibodies and/or probes may then be used in assays and 
diagnostic kits for the detection and/or quantification of activated T cells and/or activated NK cells 

10 in, e.g., bodily fluids, in tissue samples and in mammalian cell cultures. 

In a preferred embodiment, such methods of detecting the polypeptides or polynucleotides 
of the invention, e.g. using specific antibodies and/or probes, can be used to measure the effect of a 
test compound on T cell and/or NK cell activity in mammalian cell cultures. In another preferred 
embodiment, such methods can be used to monitor the effects of a treatment aiming to increase or 

15 decrease T cell and/or NK cell activity in a patient, or to detect the beginning of a graft rejection 
reaction in a patient. 

Another embodiment of the invention relates to compositions and methods for inhibiting 
the expression or activity of NK5 in a patient for the treatment or prevention of diseases and 
disorders caused as a result of T cell and/or NK cell activation. The inhibition and/or reduction of 

20 T cell and/or NK cell activation can be achieved using any suitable method, e.g. through the 
administration of a therapeutically effective amount of an antibody that specifically recognizes 
NK5 or fragment thereof to a patient. Preferably, the antibody recognizes the epitope overlapping 
the RGD domain. The antibody can be administered alone or in combination with one or more 
agent known in the art, e.g. other immuno-suppressive agents. Administration of the antibody can 

25 be done following any method known in the art, including those described in U.S. Patent 

5,817,31 1, which disclosure is hereby incorporated by reference in its entirety. Other inhibitors of 
NK5 expression or activity which can be used include, but are not limited to, antisense molecules, 
ribozymes, dominant negative forms of NK5, and compounds that decrease the activity or 
expression of NK5 in a cell. Such compounds can be readily identified, e.g. by screening test 

30 agents against T cells or natural killer cells expressing NK5, or capable of expressing NK5, and 
detecting the ability of the test agents to inhibit natural killer cell or T cell activation, or to 
diminish die level of NK5 expression. Diseases and disorders caused as a result of T cell and/or 
NK cell activation include, but are not limited to, allergy and asthma, and the methods can also be 
used in treatments for preventing and/or inhibiting on-going immune responses. More particularly, 

35 such treatments can be used to prevent, or inhibit, or Teduce in severity graft rejection, or induce 
tolerance to graft transplantation. Such transplantation may by way of example include, but not be 
limited to, transplantation of cells, bone marrow, tissue, solid-organ, bone, etc. Such treatments 
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can also be used to prevent or reduce in severity graft versus host diseases and autoimmune 
diseases, which by way of example include but are not limited to rheumatoid arthritis, systemic . 
lupus, multiple sclerosis, insulin-dependent diabetes, hepatitis, rheumatoid arthritis, Graves 
disease, etc. 

5 Another embodiment of the invention relates to the activation and/or prevention of 

inactivation of NK and/or T cells, based on compositions and methods containing, e.g., NK5 or 
fragment thereof, a polynucleotide encoding the protein, or a compound that increases the 
expression or activity of NK5. Such compounds can be readily identified, e.g. by screening test 
agents against T cells or natural killer cells expressing NK5, or capable of expressing NK5, and 

10 detecting the ability of the test agents to enhance natural killer cell or T cell activation, or to 

increase the level of NK5 expression. Diseases and disorders that may be treated and/or reduced in 
severity by T cell and/or NK cell activation include but are not limited to tumors, viral infections, 
inflammation, or conditions associated with impaired immunity, bacterial infections, hepatic 
dysfunction, liver regeneration, haematopoesis and maintenance and development of placentation. 

1 5 More particularity, such treatments can be used to treat proliferative disorders (including various 
forms of cancer such as leukemias, lymphomas, sarcomas, melanomas, adenomas, carcinomas of 
solid tissue, hypoxic tumors, squamous cell carcinomas, genitourinary cancers, hematopoietic 
cancers, head and neck cancers, and nervous system cancers, benign lesions such as papillomas, 
atherosclerosis, angiogenesis), viral infections (in particular HBV, HCV, HIV, hepatitis, measles 

20 and herpes viruses infections, as well as other viral-induced infections), and other various immune 
deficiencies. These immune deficiencies may be genetic (e. g. rheumatoid and osteo arthritis and 
severe combined immunodeficiency (SCBD)) or be caused by various bacterial or fungal infections 
(e.g. infections by mycobacteria, Leishmania spp., malaria spp. and candidiasis). Of course, NK5 
may also be useful where a boost to the immune system generally may be desirable, i.e., in 

25 radiation therapy or chemotherapy when treating the cancer. NK5 or fragment thereof can be 
administered alone or in combination with other known agents capable of activating NK and/or T 
cells, such as methods described in U.S. Patent 6,245,563 and in U.S. Patent 6,197,302, which 
disclosures are hereby incorporated by reference in their entireties. 
Protein of SEQ ID NO:62 (Internal Designation Clone 500715621_204-15-3-0-C6-F) 

30 The cDNA of Clone 500715621_204-15-3-0-C6-F (SEQ ID NO:61) encodes the 202 

amino acid long polypeptide of SEQ ID NO:62 comprising the amino acid sequence : 
MELWGAYLLLCLFSLLTQVTO 
LKEQQALQWCLKGTKVHMKCFLAFTQTKTFHESS 
LRQSVGNEAEIWLGLNDMAA^ 

35 AANGKWFDKRCRDQLPYICQFGIV. Accordingly, it will be appreciated that all characteristics 
and uses of polypeptides of SEQ ID NO:62 described throughout the present application also 
pertain to the polypeptides encoded by the nucleic acids included in Clone 500715621_204-15-3- 
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0-C6-F. In addition, it will be appreciated that all characteristics and uses of the polynucleotides of 
SEQ ID NO:61 described throughout the present application also pertain to the nucleic acids 
included in Clone 500715621J204-15-3-0-C6-F. A preferred embodiment of the invention is 
directed toward the compositions of SEQ ID NO:61, SEQ ID NO:62, and Clone 50071 562 1_204- 

5 15-3-0-C6-F. Also preferred are polypeptide fragments having a biological activity as described 
herein and the polynucleotides encoding the fragments. 

The protein of SEQ ID NO:62 represents a new variant form of the human tetranectin 
precursor polypeptide (Swissprot entry P05452), harboring an amino acid substitution at position 
94 which replaces an alanine residue by a serine residue. The protein of the SEQ ID NO:62 is a 

10 202 amino acid long polypeptide comprising a 21 amino acid signal peptide followed by a 1 81 
amino acid sequence corresponding to a mature polypeptide of the invention, Plasminogen carrier 
protein (PLCP). 

PLCP is a 68 kilodalton homotrimeric plasminogen-binding protein present in plasma. In 
addition to plasminogen, PLCP binds calcium as well as a number of sulphated polysaccharides 
1 5 including heparin, chondroitin and fucoidan. It also binds Apoliprotein A and fibrin. 

In terms of primary and tertiary structure, the protein is related to the family of Ca(2+)- 
binding C-type lectins, proteins that bind a wide diversity of compounds, including carbohydrates, 
lipids and proteins. 

The protein is encoded by three exons corresponding to three functional domains. Exon 3 

20 (nt367 to nt771 on SEQ ED NO: 61) encodes the long-form C-type Lectin domain (aa77 to aal98 . 
on SEQ ID NO:62), also termed the carbohydrate recognition domain (CRD), which is involved in 
Ca(2+) and plasminogen binding. Exon 2 (nt268 to nt366) encodes an alpha-helix domain that 
governs the trimerization of PLCP oligomers by assembling into a triple helical coiled-coil 
structural element. Finally, residues encoded by exonl (ntl3 to nt267), but not the CRD, bind 

25 heparin, suggesting a specific role for this domain in sulphated carbohydrate ligand binding 
(Lorentsen et al. 2000, Biochem. J. 347, 83-87 which disclosure is hereby incorporated by 
reference in its entirety). 

PLCP binds plasminogen via its CRD through a specific interaction with the fourth kringle 
domain of plasminogen, and binding has been reported to facilitate the proteolytic activation of 

30 plasminogen to plasmin by the tissue-type plasminogen activator. Because plasminogen activation 
is involved in a variety of extracellular proteolytic events including fibrinolysis, cell migration, 
angiogenesis, tumor cell invasion, inflammation, wound healing, and tissue remodeling, PLCP is 
useful in the modulation of these biological processes. 

The present protein is isolated from human blood, but is also found to be deposited in the 

35 extracellular matrix of various tissues. In particular, PLCP is deposited in the tumor surrounding 
stroma of breast, colon, and ovarian tumors and is found to co-localise with plasmin/plasminogen 
at the invasive front of cutaneous melanoma lesions, whereas little or no PLCP is found in the 
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corresponding normal tissues. Plasma PLCP level is reduced in cancer patients, and PLCP is 
useful as a prognostic marker for the diagnosis of certain types of cancer. 

Preferred PLCP polypeptides for uses in the methods described below include the 
polypeptides comprising the amino sequence of: 
5 EPPTQKPKKIVNAKKDV^ 

CFLAFTQTKTFHESSEDCISRGGTLSTPQTGSENDALYEYLRQS 

GTWVDJVTTGAIUAYKNWETEITAQPDGGKTEN 

GIV; 

A polypeptide comprising the amino acid sequence of: 
10 VCLKGTKVHMKCFLACT 

EIWLGLNDMAAEGTWVDMTGARIAYKNWETEff 
KRCRDQLPYICQFGIV; 

A polypeptide comprising the amino acid sequence of: 
VHMKCFLAFTQTKTFHE 
15 MAAEGTWVDMTGARIAYKNWETEITAQPDGGKTENCAVL 
YICQ 

In one embodiment, the cDNA of SEQ ID NO:61 bearing a G to T substitution at position 
438, which replaces an alanine residue by a serine at position 94 of SEQ ID NO: 62, is used for 
DNA genotyping. Indeed genotyping this locus could be of interest in DNA fingerprinting for 

20 paternity studies or forensic analyses. It could also be used for genetic association studies, 
especially in pathologies relating to coagulation disorders. 

In another embodiment, the polynucleotide sequence of the invention is used in 
pharmacogenomic applications in order to aid in the choice of the ideal drug (e.g. a coagulation or 
anticoagulation drug), or dosage of a drug, for the treatment of a condition or disease in a patient. 

25 For example, in one embodiment, the invention provides a method of genotyping the patient to 
determine the identity of the nucleotide encoding the amino acid at position 438 of SEQ ID NO:62, 
and administering to the patient a drug or a dosage of the drug that has been established to be 
preferentially efficacious in those with a serine residue at position 438 (e.g. because of preferential 
binding of the drug to the isoform of the protein with a serine at that position). In another 

30 embodiment, the patient is genotyped for the nucleotide encoding amino acid position 438, and a 
drug is determined to be not desirably administered to the patient, e.g. because side effects are 
known to be associated with the administration of the drug to individuals with a serine at position 
438. 

In another embodiment, the present protein is used to copurify plasminogen from a 
35 biological sample, preferably from a liver cell extract. This is achieved using any method, a large 
number of which are known in the art. For example, plasminogen is purified using affinity column 
chromatography with the protein of SEQ ID NO:62 or by coimmtmopurification using a 
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monoclonal or polyclonal antibody that specifically binds the protein of the invention. Purified 
plasminogen is useful for many purposes, including for the preparation of therapeutic fibrinolytic 
compositions. 

In a further embodiment, the present protein provides a method to purify a protein 

5 harboring one or more kringle domains from a cellular extract, the method comprising using a 
fragment of the present protein retaining an intact CRD domain, preferably a fragment restricted to 
the CRD domain itself, to purify the kringle domain-containing protein, e.g. using a method such 
as affinity chromatography. Preferably, the protein to be purified is selected from the group 
consisting of plasminogen, angiostatin, thrombin, Hepatocyte Growth Factor, Macrophage 

10 Stimulating Protein and apolipoprotein a. The protein to be purified using the present method is 
derived from any source, e.g. protein expressed in vitro using an invertebrate, yeast or bacterial 
heterologous expression system. 

In another embodiment, the present protein provides a method to determine the localization 
of plasminogen in vivo or ex vivo. In one such method, a tissue section is contacted with a labeled 

15 protein of SEQ ID NO:62, and the labeling in the tissue section is detected. Plasminogen can also 
be detected directly from crude cell or tissue extracts using the protein of the invention. Methods 
for labeling proteins are well known in the art, any of which is used in the present invention. 

In another embodiment, the protein of SEQ ID NO:62 is used to determine circulating 
levels of plasminogen in the blood of an individual, the method comprising obtaining a blood 

20 sample from the individual, using the protein of the invention to copurify plasminogen from the 
blood sample (e.g. by affinity column chromatography), and measuring the level of plasminogen in 
the sample using methods well known in the art, for example Elisa, western blot or 
radioimmunoessay (RIA). Determining plasminogen levels in circulating blood could be of special 
interest for the monitoring of patients with diseases associated with impaired coagulation or 

25 fibrinolysis. 

In another embodiment, the present protein is used as a diagnostic or pronostic marker for 
breast cancer, ovarian cancer, colon or colorectal cancers, the method comprising contacting a 
blood sample from a patient, preferably a serum sample, with an antibody directed to the present 
protein, and determining the level of PLCP in the sample compared to a control level 

30 representative of a healthy patient, wherein a lower level of PLCP in the patient sample relative to 
the control level indicates that the patient has the disease, is at an elevated risk of developing the 
disease, or has a worse prognosis that a patient with normal levels of the protein. The antibody 
used is either monoclonal or polyclonal and is labeled directly or indirectly for quantification of 
immune complexes by methods well known to those skilled in the art. 

35 In another embodiment, the present protein provides a transgenic animal, preferably a 

mammal, more preferably a rodent, with impaired fibrinolytic activity due to no or reduced 
expression of the protein of SEQ ID NO:62. Such transgenic animals provide a powerful model in 
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which to study pathologies associated with defective fibrinolysis, especially fibrosis and 

thrombosis. In addition, such animal is used to screen candidate molecules for the ability to inhibit 

coagulation or fibrosis. 

Transgenic animals with reduced or eliminated PLCP expresssion or activity is obtained 
5 using any of a number of ways, including by PLCP gene knock-out, for example in the mouse, 

using DNA microinjection into fertilized eggs or transfection of embryonic stem cells. 

Alternatively, low level expression of the present protein is achieved using antisens methods, e.g., 

by placing the reverse nucleotide sequence encoding the protein of SEQ ID NO:62 under the 

control of a strong promoter sequence. Preferably, a regulatable and ubiquitous promoter sequence 
10 is used in order to temporally control the expression of the genetic construct once introduced into 

the animal. Other methods suitable for use in the present methods include the use of ribozymes, 

antibodies, and dominant negative forms of the present protein. 

In another embodiment, the present protein provides a method to increase fibrinolysis in an 

individual, the method comprising administering to said individual an amount of the present 
1 5 protein sufficient to increase plasminogen activation. The present protein is administered in any of 

a number of ways, including by intravenous injection. Such methods is used in order to eliminate 

clots in the prevention or the treatment of cardiovascular diseases including, but not limited to, 

strokes or pulmonary embolisms. 

Protein of SEQ ID NO:64 (Internal Designation Clone 165843_116-008-4-0-G4-F) 

20 The cDNA of Clone 1 65 843_1 16-00 8-4-0-G4-F (SEQ ID NO:63) encodes Novel . 

Calpastatin 1 (NCI) protein of SEQ ID NO:64, comprising the amino acid sequence: 
MTVLEITLAVILTLLGLAILAILLTRWARRKQSEMffl 
TQSERSKRDYTPSTNSLALSRSSIALPQGSMSSIKCLQTTEELPSRTAGAM 
FALLNC. Accordingly, it will be appreciated that all characteristics and uses of the polypeptides 

25 of SEQ ID NO:64 described throughout the present application also pertain to the polypeptides 
encoded by the nucleic acids included in Clone 165843_1 16-008-4-0-G4-F. hi addition, it will be 
appreciated that all characteristics and uses of the polynucleotides of SEQ ID NO:64 described 
throughout the present application also pertain to the nucleic acids included in Clonel65843_l 16- 
008-4-0-G4-F. A preferred embodiment of the invention is directed toward the compositions of 

30 SEQ ID NO:63, 64 and Clone 165843_1 16-008-4-0-G4-F. Also preferred are polypeptide 
fragments having a biological activity as described herein and the polynucleotides encoding the 
fragments. 

NCI is a physiological inhibitor of calpains. Calpains, a group of ubiquitous Ca2+ - 
activated cytosolic proteases, have been implicated in cytoskeletal remodeling events, cellular 
35 adhesion, shape change, and mobility involving site-specific regulatory proteolysis of membrane- 
and actin-associated cytoskeletal proteins and apoptosis [Beckerle et al., Cell 51:569-577, 1987; 
Yao et al., Am. J. Physiol. 265(pt. l):C36-46, 1993; and Shuster et al., J. Cell Biol. 128:837-848, 
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1995; Squier et al., J. Cell Physiol., 178(3): 311-319, 1999]. Calpains have also been implicated in 
the pathophysiology of cerebral and myocardial ischemia, platelet activation, NF-kB activation, 
Alzheimer's disease, muscular dystrophy, cataract progression and rheumatoid arthritis. There is 
considerable interest in inhibitors of calpain, as cellular adhesion, cytoskeletal remodeling events 

5 and cell mobility are linked to numerous pathologies (Wang et al., Trends in Pharm. Sci. 15:412- 
419, 1994; Mehdi, Trends in Biochem. Sci. 16:150-153, 1991). In addition, as the 
calpain/calpastatin system is involved in membrane fusion events for several cell types, and 
calpain can be detected in human sperm and testes extracts by Western blotting with specific 
antisera, tCAST may modulate calpain in the calcium-mediated acrosome reaction that is required 

10 for fertilization (Li S et al., Biol Reprod, 63(l):172-8, 2000). 

NCI has a unique N-terminal domain (domain L) and four repetitive protease-inhibitor 
domains (domains I-IV) (Lee WJ et al., J Biol Chem, 267(12):8437-42, 1992). The protein of SEQ 
ED NO:64 has calpastatin domains T and II. The T domain targets cytosolic localization and 
membrane association, whereas domain I exhibits a nuclear localization function. 

15 NCI plays a role in cytoskeletal remodeling events, cellular adhesion, shape change, and 

mobility by the site-specific regulatory proteolysis of membrane- and actin-associated cytoskeletal 
proteins. Preferred polypeptides of the invention are polypeptides comprising the amino acids of 
SEQ ID NO:64 from positions 1 to 1 16. Also preferred are fragments of SEQ ID NO:64 having a 
biological activity as described therein and the polynucleotides encoding the fragments. 

20 One embodiment of the present invention relates to methods of using the protein of the 

invention or fragment thereof in assays to detect the presence of calpain in a biological sample, 
such as in bodily fluids, in tissue samples, or in mammalian cell cultures. As NCI binds calpain 
(Murachi, Biochemistry Lit., 18(2)263-294, 1989), the protein of the invention can be used in 
assays and diagnostic kits to test the presence of calpain using techniques known to those skilled in 

25 the art. Preferably, a defined quantity of the protein of the invention or fragment thereof is added 
to the sample under conditions allowing the formation of a complex between the protein of the 
invention or fragment thereof and heterologous proteins, and the presence of a complex and/or the 
free protein of the invention or fragment thereof is assayed and compared to a control. NCI is 
useful as a marker of intracellular calpain activation, and can be used for monitoring the 

30 involvement of calpain in pathological situations (De Tullio et al., FEBS letter, 475(1): 17-21, 
2000). Calpain has been implicated in cytoskeletal protein degradation involved in the 
pathophysiology of ischemia and disorders like Alzheimer's disease (Wronski et al., J. Neural 
transm., 107(2):145-157, 2000) and Parkinson's disease (Mouatt-Prigent et al., J. Comp. Neurol., 
419:175-92, 2000), apoptosis in neural cells of rat with spinal cord injury (SCI) (Ray, Brain res., 

35 867(l-2):80-9, 2000), cell fusibility (Kosower et al., Methods Mol Biol., 144:181-94, 2000) and 
other physiopathologies. Assays detecting any increased or decreased calpain levels in a cell are 
thus useful in the diagnosis of any of these diseases or conditions. In addition, a recent study 
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showed that in addition to their proteolytic activities on cytoskeletal proteins and other cellular 
regulatory proteins, calpain-calpastatin systems can also affect expression levels of genes encoding 
structural or regulatory proteins (Chen et aL, Am. J. Physiol. Cell Physiol, 279:C709-C716, 2000). 
Thus, the ability to detect NCI and calpain levels is also useful for the diagnosis of an even larger 
5 number of diseases and conditions. 

In another embodiment, the polynucleotides or polypeptides of the invention may be used 
for the detection of gametes, gametic precursor cells (such as spermatogenic stem cells), or of 
specific structures within the gametes, using any technique known to those skilled in the art, 
including those involving the use of specific antibodies and nucleic acid probes. The ability to 

10 visualize spermatozoa generally, or the sperm acrosome in particular, has obvious utility for a 
number of applications, including for the analysis of infertility in patients. 

Another embodiment of the present invention relates to a method of inhibiting calpain in a 
cell, the method comprising administering to the cell an amount of the present protein sufficient to 
inhibit calpain in the cell. Such methods can be performed in vitro or in vivo. The inhibition of 

15 calpain has numerous uses in the treatment or prevention of various diseases and conditions, for 
example the pathophysiology of cerebral, myocardial, renal ischemia, platelet activation, NF-kB 
activation, Alzheimer's disease, Parkinson's disease, muscular dystrophy, cataract progression, 
cancer cachexia and rheumatoid arthritis. Such an increase can be effected in any of a number of 
ways, including, but not limited to administering purified protein of the invention directly to the 

20 cells, transfecting the cells with a polynucleotide encoding the protein, operably linked to a 
promoter; and administering to a cell a compound that increases the activity or expression of the 
protein of the invention. In addition, the expression or activation of the protein of the invention 
can be inhibited in any of a large number of ways, including using antisense oligonucleotides, 
antibodies, dominant negative forms of the protein, and using heterologous compounds that 

25 decrease the expression or activation of the protein. Such compounds can be readily identified, 
e.g. by screening candidate compounds and detecting the level of expression or activity of the 
protein using any standard assay. Other calpain inhibitors are also known which can be used in 
conjunction with the present protein, or which can be used as controls in the identification of 
additional inhibitors or activators of calpastatin. Such inhibitors include, but are not limited to, 

30 cerebrolysin (Wranski et al., J. Neural Transm. Suppl., 59:263-272, 2000), E-64-D (Ray et al., 
Brain Res., 867(1-2): 80-9, 2000), and the calpain active site inhibitor N-acetyl-leucyl-leucyl- 
norleucinal (Squier et al., J. Cell Physiol., 178(3): 311-319, 1999). 

In still another embodiment, the protein of SEQ ID:64 or fragment thereof can be used to 
prevent cells from undergoing apoptosis. Specifically, any method of increasing the level or 

35 activity of the present protein in cells can be used to prevent the cells from undergoing apoptosis, 
in vitro or in vivo. For example, a polynucleotide encoding a protein of SEQ ID NO:64, or any 
fragment or derivative thereof, can be introduced into cells, e.g. in a vector, wherein the protein is 
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expressed in the cells. Alternatively, a protein of SEQ ID NO:64 itself can be administered to 
cells, preferably in a formulation that leads to the internalization of the protein by the cells. Also, 
any compound that increases the expression or activation of the proteins within the cells can be 
administered. Preventing cells from undergoing apoptosis can be used for any of a large number of 
5 purposes, including, but not limited to, to prevent the death of cells being grown in culture, to 
prevent in a patient the apoptosis associated with any of a number of disorders, or to prevent 
apoptosis in cells of a patient undergoing a treatment that increases the level of cellular stress, such 
as chemotherapy. Furthermore, the invention relates to methods and compositions using the 
protein of the invention or fragment thereof to diagnose, prevent and/or treat disorders 

10 characterized by abnormal cell proliferation and/or programmed cell death, including but not 
limited to cancer, immune deficiency syndromes (including ADDS), type I diabetes, pathogenic 
infections, cardiovascular and neurological injury, alopecia, aging, degenerative diseases such as 
Alzheimer's Disease, Parkinson's Disease, Huntington's disease, dystonia, Leber's hereditary optic 
neuropathy, schizophrenia, and myodegenerative disorders such as "mitochondrial encephalopathy, 

15 lactic acidosis, and stroke" (MELAS), and "myoclonic epilepsy ragged red fiber syndrome" 

(MERRF). For diagnostic purposes, the expression of the protein of the invention can be detected 
using any method such as Northern blotting, RT-PCR or immunoblotting methods, and compared 
to the expression in control individuals, wherein an increase or decrease of the level of the present 
protein compared to the control level indicates the presence of the disease or condition, or of a 

20 propensity for the disease or condition. For prevention and/or treatment purposes of disorders in 
which cell proliferation needs to be reduced and/or apoptosis increased, the expression of protein 
of the invention may be enhanced using any method, for example administering the purified 
protein to cells, transfecting the cells with a polynucleotide encoding the protein, or administering 
to the cells a compound that increases the expression or activity of the protein. For prevention 

25 and/or treatment purposes of disorders in which cell proliferation needs to be enhanced and/or 
apoptosis reduced, inhibition of endogenous expression of the protein of the invention may be 
achieved using any method , including triple helix and antisense strategies. 

In another embodiment, inhibiting the proteins of the invention can be used to induce 
apoptosis in undesired cells. Such inhibition can be accomplished in any of a number of ways, 

30 including, but not limited to, using antibodies, antisense sequences, dominant negative forms of the 
protein, or small molecule inhibitors of the expression or activity of the proteins. Such induction 
of apoptosis can be used to eliminate any undesired cells, for example cancer cells, in a patient. 
Preferably, such inhibitors are targeted specifically to the undesired cells in the patient using 
standard methods. 

35 In another preferred embodiment, the protein of the invention can be used to modulate 

and/or characterize fertility, including for the treatment or diagnosis of infertility, and for 
contraception. As NCI is involved in the acrosomal reaction which is a required step in 
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fertilization, over- or under-expression or activation of the present protein can be used to disrupt 
this reaction and thereby inhibit fertility. For example, for contraception, the expression or 
activation of the protein can be artificially disrupted, for example by increasing the protein level 
using polynucleotides encoding the protein, using the protein itself, or using activators of protein 
5 expression or activity, or by decreasing the protein level using inhibitors such as antisense 
oligonucleotides, antibodies, dominant negative forms of the protein, and using heterologous 
compounds that inhibit protein expression or activity. Similarly, the cause of infertility in many 
patients can be detected by detecting the level of expression of the present protein, where an 
abnormal level of activity or expression of the protein indicates that a cause of infertility involves 

10 the calpain-dependent acrosomal reaction. Such a diagnosis would also point to methods of 
treating the infertility, e.g. by increasing or decreasing the expression or activation of the present 
protein in spermatozoa. 

In another embodiment, the invention relates to methods and compositions using the 
protein of the invention or fragment thereof as a marker protein to selectively identify tissues, 

15 preferably testis, or to distinguish between two or more possible sources of a tissue sample on the 
basis of the level of the protein of SEQ ID NO:64 in the sample. For example, the protein of SEQ 
ID NO:64 or fragments thereof may be used to generate antibodies using any techniques known to 
those skilled in the art, including those described therein. Such tissue-specific antibodies may then 
be used to identify tissues of unknown origin, for example, forensic samples, differentiated tumor 

20 tissue that has metastasized to foreign bodily sites, or to differentiate different tissue types in a 
tissue cross-section using immunochemistry. In such methods a tissue sample is contacted with the 
antibody, which may be detectably labeled, under conditions which facilitate antibody binding. 
The level of antibody binding to the test sample is measured and compared to the level of binding 
to control cells from testis or tissues other than testis to determine whether the test sample is from 

25 testis. Similar methods can be used to specifically detect cells expressing the protein, as well as to 
specifically isolate cells expressing the protein or to isolate the protein itself. For example, an 
antibody against the protein of SEQ ID NO:64 or a fragment thereof may be fixed to a solid 
support, such as a chromatography matrix. A preparation containing cells expressing the protein of 
SEQ ID NO:64 is placed in contact with the antibody under conditions which facilitate binding to 

30 the antibody. The support is washed and then the protein is released from the support by 
contacting the support with agents which cause the protein to dissociate from the antibody. 

Alternatively, the level of the protein of SEQ ID NO:64 in a test sample may be measured 
by determining the level of RNA encoding the protein of SEQ ID NO:64 in the test sample. RNA 
levels may be measured using nucleic acid arrays or using techniques such as in situ hybridization, 

35 Northern blots, dot blots or other techniques familiar to those skilled in the art. If desired, an 
amplification reaction, such as a PCR reaction, may be performed on the nucleic acid sample prior 
. to analysis. The level of RNA in the test sample is compared to RNA levels in control cells from 
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testis or tissues other than testis to determine whether the test sample is from testis. For a number 
of disorders listed above, particularly of inflammatory processes, expression of the genes encoding 
the polypeptide of SEQ ID NO;64 at significant higher or lower levels may be routinely detected in 
certain tissues or cell types (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
5 plasma, synovial fluid, and spinal fluid) or another tissue of cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

In another embodiment, the invention relates to methods for using the protein of the 
invention or fragments to identify autoantibodies which indicate inflammatory processes and 

10 particularly, rheumatoid arthritis (RA), a systemic disease characterized by chronic polyarthritis 
and joint destruction, and in which high levels of autoantibodies directed against calpastatin have 
been identified. Accordingly, the present protein may be used to detect the presence and/or the 
localization of autoantibodies in a cell, hi a typical embodiment, the protein of SEQ ID NO:64 is 
labeled with any detectable moiety including, but are not limited to, a fluorescent label, a 

1 5 radioactive atom, a paramagnetic ion, biotin, a chemiluminescent label or a label which can be 
detected through a secondary enzymatic or binding step. The invention further provides a method 
of diagnosing inflammatory processes, e.g. rheumatoid arthritis, and distinguishing such processes 
from other diseases. 

Protein of SEQ ID NO:66 (Internal Designation 335752_157-15-4-0-Bll-F) 

20 The cDNA of Clone 335752_157-15-4-0-Bl 1-F (SEQ ID NO:65) encodes Novel 

Calpastatin 2 (NC2) protein of SEQ ID NO:66, comprising the amino acid sequence: 
MTVT,ErTLAVILTLLGL 
TQSERSKRDYTPSTNSLALSRSSL^ 
GPKLSQKTIVQTLGPrVQYPGS 

25 YMNSLSLFSPA. Accordingly, it will be appreciated that all characteristics and uses of the 
polypeptides of SEQ ID NO:66 described throughout the present application also pertain to the 
polypeptides encoded by the nucleic acids included in Clone 335752_157-15-4-0-Bl 1-F. In 
addition, it will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NO:66 described throughout the present application also pertain to the nucleic acids included in 

30 Clone 335752_157-15-4-0-Bl 1-F. A preferred embodiment of the invention is directed toward the 
compositions of SEQ ID NO:65, 66 and Clone 335752J57-15-4-0-B1 1-F. Also preferred are 
polypeptide fragments having a biological activity as described herein and the polynucleotides 
encoding the fragments. 

NC2 is a physiological inhibitor of calpains. Calpains, a group of ubiquitous Ca2+ - 

35 activated cytosolic proteases, have been implicated in cytoskeletal remodeling events, cellular 
adhesion, shape change, and mobility involving site-specific regulatory proteolysis of membrane- 
and actin-associated cytoskeletal proteins and apoptosis (Beckerle et al., Cell 51:569-577, 1987; 
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Yao et al., Am. J. Physiol. 265(pt. l):C36-46, 1993; and Shuster et al., J. Cell Biol. 128:837-848, 
1995; Squier et al., J. Cell Physiol., 178(3): 311-319, 1999). Calpains have also been implicated in 
the pathophysiology of cerebral and myocardial ischemia, platelet activation, NF-fcB activation, 
Alzheimer's disease, muscular dystrophy, cataract progression and rheumatoid arthritis. There is 
5 considerable interest in inhibitors of calpain, as cellular adhesion, cytoskeletal remodeling events 
and cell mobility are linked to numerous pathologies (Wang et al., Trends in Pharm. Sci. 15:412- 
419, 1994; Mehdi, Trends in Biochem. Sci. 16:150-153, 1991). In addition, as the 
calpain/calpastatin system is involved in membrane fusion events for several cell types, and 
calpain can be detected in human sperm and testes extracts. 

10 NC2 consists of calpastatin domain T and II. The T domain targets cytosolic localization 

and membrane association, whereas domain I of exhibits a nuclear localization function. 

The protein of SEQ ID NO:66 is a novel member of the calpastatin family and, as such, 
plays a role in cytoskeletal remodeling events, cellular adhesion, shape change, and mobility by the 
site-specific regulatory proteolysis of membrane- and actin-associated cytoskeletal proteins. 

15 Preferred polypeptides of the invention are polypeptides comprising the amino acids of SEQ ID 
NO:66 from positions 1 to 1 1 6. Also preferred are fragments of SEQ ID NO:66 having a 
biological activity as described therein and the polynucleotides encoding the fragments. 

One embodiment of the present invention relates to methods of using the protein of the 
invention or fragment thereof in assays to detect the presence of calpain in a biological sample, 

20 such as in bodily fluids, in tissue samples, or in mammalian cell cultures. As NC2 binds calpain, 
the protein of the invention can be used in assays and diagnostic kits to test the presence of calpain 
using techniques known to those skilled in the art. Preferably, a defined quantity of the protein of 
. the invention or fragment thereof is added to the sample under conditions allowing the formation 
of a complex between the protein of the invention or fragment thereof and heterologous proteins, 

25 and the presence of a complex and/or the free protein of the invention or fragment thereof is 
assayed and compared to a control. NC2 is useful as a marker of intracellular calpain activation, 
and can be used for monitoring the involvement of calpain in pathological situations (De Tullio et 
al., FEBS letter, 475(1): 17-21, 2000). Calpain has been implicated in cytoskeletal protein 
degradation involved in the pathophysiology of ischemia and disorders like Alzheimer's disease 

30 (Wronski et al., J. Neural transm., 107(2):145-157, 2000) and Parkinson's disease (Mouatt-Prigent 
et al., J. Comp. Neurol., 419:175-92, 2000), apoptosis in neural cells of rat with spinal cord injury 
(SCI) (Ray, Brain res., 867(l-2):80-9, 2000), cell fusibility (Kosower et al., Methods Mol Biol., 
144:181-94, 2000) and other physiopathologies. Assays detecting any increased or decreased 
calpain levels in a cell are thus useful in the diagnosis of any of these diseases or conditions. In 

35 addition to proteolytic activities on cytoskeletal proteins and other cellular regulatory proteins, 
calpain-NC2 systems can also affect expression levels of genes encoding structural or regulatory 
proteins. Thus, the ability to detect NC2 and calpain levels is also useful for the diagnosis of an 
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even larger number of diseases and conditions. 

In another embodiment, the polynucleotides or polypeptides of the invention may be used 
for the detection of gametes, gametic precursor cells (such as spermatogenic stem cells), or of 
specific structures within the gametes, using any technique known to those skilled in the art, 

5 including those involving the use of specific antibodies and nucleic acid probes. The ability to 
visualize spermatozoa generally, or the sperm acrosome in particular, has obvious utility for a 
number of applications, including for the analysis of infertility in patients. 

Another embodiment of the present invention relates to a method of inhibiting calpain in a 
cell, the method comprising administering to the cell an amount of the present protein sufficient to 

10 inhibit calpain in the cell. Such methods can be performed in vitro or in vivo. The inhibition of 
calpain has numerous uses in the treatment or prevention of various diseases and conditions, for 
example, the pathophysiology of cerebral, myocardial, renal ischemia, platelet activation, NF-kB 
activation, Alzheimer's disease, Parkinson's disease, muscular dystrophy, cataract progression, 
cancer cachexia and rheumatoid arthritis. Such an increase can be effected in any of a number of 

1 5 ways, including, but not limited to administering purified protein of the invention directly to the 
cells, transfecting the cells with a polynucleotide encoding the protein, operably linked to a 
promoter, and administering to a cell a compound that increases the activity or expression of the 
protein of the invention. In addition, the expression or activation of the protein of the invention 
can be inhibited in any of a large number of ways, including using antisense oligonucleotides, 

20 antibodies, dominant negative forms of the protein, and using heterologous compounds that 
decrease the expression or activation of the protein. Such compounds can be readily identified, 
e.g. by screening candidate compounds and detecting the level of expression or activity of the 
protein using any standard assay. Other calpain inhibitors are also known which can be used in 
conjunction with the present protein, or which can be used as controls in the identification of 

25 additional inhibitors or activators of calpastatin. Such inhibitors include, but are not limited to, 
cerebrolysin (Wronski et al., J. Neural Transm. Suppl., 59:263-272, 2000), E-64-D (Ray et al., 
Brain Res., 867(1-2): 80-9, 2000), and the calpain active site inhibitor N-acetyl-leucyl-leucyl- 
norleucinal (Squier et al., J. Cell Physiol., 178(3): 311-319, 1999). 

In still another embodiment, the protein of SEQ ID:66 or fragment thereof can be used to 

30 prevent cells from undergoing apoptosis. Specifically, any method of increasing the level or 
activity of the present protein in cells can be used to prevent the cells from undergoing apoptosis, 
in vitro or in vivo. For example, a polynucleotide encoding a protein of SEQ ID NO:66, or any 
fragment or derivative thereof, can be introduced into cells, e.g. in a vector, wherein the protein is 
expressed in the cells. Alternatively, a protein of SEQ ID NO:66 itself can be administered to 

35 cells, preferably in a formulation that leads to the internalization of the protein by the cells. Also, 
any compound that increases the expression or activation of the proteins within the cells can be 
administered. Preventing cells from undergoing apoptosis can be used for any of a large number of 
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purposes, including, but not limited to, to prevent the death of cells being grown in culture, to 
prevent in a patient the apoptosis associated with any of a number of disorders, or to prevent 
apoptosis in cells of a patient undergoing a treatment that increases the level of cellular stress, such 
as chemotherapy. Furthermore, the invention relates to methods and compositions using the 
5 protein of the invention or fragment thereof to diagnose, prevent and/or treat disorders 
characterized by abnormal cell proliferation and/or programmed cell death, including but not 
limited to cancer, immune deficiency syndromes (including AIDS), type I diabetes, pathogenic 
infections, cardiovascular and neurological injury, alopecia, aging, degenerative diseases such as 
Alzheimer's Disease, Parkinson's Disease, Huntington's disease, dystonia, Leber's hereditary optic 

10 neuropathy, schizophrenia, and myodegenerative disorders such as "mitochondrial encephalopathy, 
lactic acidosis, and stroke" (MELAS), and "myoclonic epilepsy ragged red fiber syndrome" 
(MERRF). For diagnostic purposes, the expression of the protein of the invention can be detected 
using any method such as Northern blotting, RT-PCR or immunoblotting methods, and compared 
. to the expression in control individuals, wherein an increase or decrease of the level of the present 

1 5 protein compared to the control level indicates the presence of the disease or condition, or of a 
propensity for the disease or condition. For prevention and/or treatment purposes of disorders in 
which cell proliferation needs to be reduced and/or apoptosis increased, the expression of protein 
of the invention may be enhanced using any method, for example administering the purified 
protein to cells, transfecting the cells with a polynucleotide encoding the protein, or administering 

20 to the cells a compound that increases the expression or activity of the protein. For prevention 
and/or treatment purposes of disorders in which cell proliferation needs to be enhanced and/or 
apoptosis reduced, inhibition of endogenous expression of the protein of the invention may be 
achieved using any method , including triple helix and antisense strategies. 

In another embodiment, inhibiting the proteins of the invention can be used to induce 

25 apoptosis in undesired cells. Such inhibition can be accomplished in any of a number of ways, 
including, but not limited to, using antibodies, antisense sequences, dominant negative forms of the 
protein, or small molecule inhibitors of the expression or activity of the proteins. Such induction 
of apoptosis can be used to eliminate any undesired cells, for example cancer cells, in a patient. 
Preferably, such inhibitors are targeted specifically to the undesired cells in the patient using 

30 standard methods. 

In another preferred embodiment, the protein of the invention can be used to modulate 
and/or characterize fertility, including for the treatment or diagnosis of infertility, and for 
. contraception. As NC2 is involved in the acrosomal reaction which is a required step in 
fertilization, over- or under-expression or activation of the present protein can be used to disrupt 

35 this reaction and thereby inhibit fertility. For example, for contraception, the expression or 
activation of the protein can be artificially disrupted, for example by increasing the protein level 
using polynucleotides encoding the protein, using the protein itself, or using activators of protein 
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expression or activity, or by decreasing the protein level using inhibitors such as antisense 
oligonucleotides, antibodies, dominant negative forms of the protein, and using heterologous 
compounds that inhibit protein expression or activity. Similarly, the cause of infertility in many 
patients can be detected by detecting the level of expression of the present protein, where an 
5 abnormal level of activity or expression of the protein indicates that a cause of infertility involves 
the calpain-dependent acrosomal reaction. Such a diagnosis would also point to methods of 
treating the infertility, e.g. by increasing or decreasing the expression or activation of the present 
protein in spermatozoa. 

In another embodiment, the invention relates to methods and compositions using the 

1 0 protein of the invention or fragment thereof as a marker protein to selectively identify tissues, 
preferably testis, or to distinguish between two or more possible sources of a tissue sample on the 
basis of the level of the protein of SEQ ED NO:66 in the sample. For example, the protein of SEQ 
ID NO:66 or fragments thereof may be used to generate antibodies using any techniques known to 
those skilled in the art, including those described therein. Such tissue-specific antibodies may then 

15 be used to identify tissues of unknown origin, for example, forensic samples, differentiated tumor 
tissue that has metastasized to foreign bodily sites, or to differentiate different tissue types in a 
tissue cross-section using immunochemistry. In such methods a tissue sample is contacted with the 
antibody, which may be detectably labeled, under conditions which facilitate antibody binding. 
The level of antibody binding to the test sample is measured and compared to the level of binding 

20 , to control cells from testis or tissues other than testis to determine whether the test sample is from 
testis. Similar methods can be used to specifically detect cells expressing the protein, as well as to 
specifically isolate cells expressing the protein or to isolate the protein itself. For example, an 
antibody against the protein of SEQ ID NO:66 or a fragment thereof may be fixed to a solid 
. support, such as a chromatography matrix. A preparation containing cells expressing the protein of 

25 SEQ ID NO:66 is placed in contact with the antibody under conditions which facilitate binding to 
the antibody. The support is washed and then the protein is released from the support by 
contacting the support with agents which cause the protein to dissociate from the antibody. 

Alternatively, the level of the protein of SEQ ID NO:66 in a test sample may be measured 
by determining the level of RNA encoding the protein of SEQ ID NO:66 in the test sample. RNA 

30 levels may be measured using nucleic acid anays or using techniques such as in situ hybridization, 
Northern blots, dot blots or other techniques familiar to those skilled in the art. If desired, an 
amplification reaction, such as a PCR reaction, may be performed on the nucleic acid sample prior 
to analysis. The level of RNA in the test sample is compared to RNA levels in control cells from 
testis or tissues other than testis to determine whether the test sample is from testis. For a number 

35 of disorders listed above, particularly of inflammatory processes, expression of the genes encoding 
the polypeptide of SEQ ID NO:66 at significant higher or lower levels may be routinely detected in 
certain tissues or cell types (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
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plasma, synovial fluid, and spinal fluid) or another tissue of cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

In another embodiment, the invention relates to methods for using the protein of the 
5 invention or fragments to identify autoantibodies which indicate inflammatory processes and 
particularly, rheumatoid arthritis (RA), a systemic disease characterized by chronic polyarthritis 
and joint destruction, and in which high levels of autoantibodies directed against calpastatin have 
been identified. Accordingly, the present protein may be used to detect the presence and/or the 
localization of autoantibodies in a cell. In a typical embodiment, the protein of SEQ ID NO:66 is 
10 labeled with any detectable moiety including, but are not limited to, a fluorescent label, a 
radioactive atom, a paramagnetic ion, biotin, a chemiluminescent label or a label which can be 
detected through a secondary enzymatic or binding step. The invention further provides a method 
of diagnosing inflammatory processes, e.g. rheumatoid arthritis, and distinguishing such processes 
from other diseases. 

15 Protein of SEQ ID NO:68 (internal designation Clone 646607_181-1 5-2-0-E2-F) 

The cDNA of Clone 646607_181-15-2-0-E2-F (SEQ ID NO:67) encodes Benzodiazepine 
Receptor 2 (BZRP-R2) protein of SEQ ID NO:68, comprising the amino acid sequence: 
MRLQGAffVLLPHLGPILVWLFTR^ 
ASYLVWK3)LGGGLGWPLAL^ 

20 STALIWHPINKLAALLLIJPYLAWLTVTSA^ 

Accordingly, it will be appreciated that all characteristics and uses of the polypeptides of SEQ ID 
NO:68 described throughout the present application also pertain to the polypeptides encoded by the 
nucleic acids included in Clone 646607_181-15-2-0-E2-F. In addition, it will be appreciated that 
all characteristics and uses of the polynucleotides of SEQ ID NO:68 described throughout the 
. 25 present application also pertain to the nucleic acids included in Clone 646607_1 8 1 -1 5-2-0-E2-F. 
A preferred embodiment of the invention is directed toward the compositions of SEQ ID NO:67, 
-68 and Clone 646607_181-15-2-0-E2-F. Also preferred are polypeptide fragments having a 
biological activity as described herein and the polynucleotides encoding the fragments. 

BZRP-R2 is homologous to peripheral benzodiazepine receptor/isoquinoline binding 

30 protein (PBR/DBP) of human, bovine and murine origin (Genbank accession numbers M36035, 
M64520 and L17306 respectively). The 170-amino-acid protein of SEQ ID NO: 68 is similar in 
size and hydropathicity to known peripheral PBR/IBP benzodiazepine receptors/isoquinoline 
binding proteins. BZRP-R2 has five transmembrane domains at positions 3-23, 45-65, 82-102, 
105-125 and 130-150. Moreover, BZRP-R2 displays a stretch of 1 1 amino acids (starting with 

35 V144 and ending with Rl 54) that corresponds to a recently identified putative cholesterol 

recognition/interaction amino acid consensus pattern (-I7V-(X)(l-5)-Y-pQ(l-5)-R/K-) [See Li et 
al, Endocrinology 1998 Dec; 139 (12): 4991-7]. 
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BZRP-R2 is capable of binding benzodiazepine and imidazopyridine derivatives, but is 
distinct from the GABA neurotransmitter receptor. BZRP-R2 polypeptides are most abundant in 
steroidogenic cells and are found primarily on outer mitochondrial membranes. BZRP-R2 is 
associated with a 34-fcDa pore-forming, voltage-dependent anion channel protein located on the 
5 outer/inner mitochondrial membrane contact sites. Ligands of BZRP-R2, upon binding to the 
receptor, simulate steroid synthesis in steroidogenic cells in vitro and in vivo. BZRP-R2 stimulates 
steroid formation by increasing the rate of cholesterol transfer from the outer to the inner 
mitochondrial membrane. 

In addition to its role in mediating cholesterol movement across membranes, BZRP-R2 has 

10 been implicated in several other physiological functions, including cell growth and differentiation, 
chemotaxis, mitochondrial physiology, porphyrin and heme biosynthesis, immune response, and 
anion transport. In addition, BZRP-R2 agonists are potent anti-apoptotic compounds. 

BZRP-R2 is associated with stress and anxiety disorders. BZRP-R2 plays a role in the 
regulation of several stress systems such as the HPA axis, the sympathetic nervous system, the 

15 renin-angiotensin axis, and the neuroendocrine axis. In these systems, acute stress typically leads 
to increases in BZRP-R2 density, whereas chronic stress typically leads to decreases in BZRP-R2 
density. For example, in Generalized Anxiety Disorder (GAD), Panic Disorder (PD), Generalized 
Social Phobia (GSP), and Post-Traumatic Stress Disorders (PTSD), BZRP-R2 density is typically 
decreased. BZRP-R2 is expressed glial cells in the brain. Furthermore, BZRP-R2 expression is 

20 increased in neurodegenerative disorders and after neurotoxic and traumatic-ischemic brain 
damage. BZRP-R2 expression is decreased in chronic schizophrenics, suggesting that the 
decreased density of BZRP-R2 in the brain may be involved in the pathophysiology of 
schizophrenia. However, BZRP-R2 is higher than normal in autopsied brain tissue from PSE 
patients (Portal-Systemic Encephalopathy patients). 

25 BZRP-R2 increases mitochondrial activity and prevents apoptosis and is therefore 

implicated tumor cell proliferation. BZRP-R2 is preferentially expressed in liver and breast 
cancers. Further, BZRP-R2 is useful as a tool/marker for detection, diagnosis, prognosis and 
treatment of cancer. 

Many ligands have been described that bind to BZRP-R2 with various affinities. Some 
30 benzodiazepines, Ro 5-4864 [4-chlorodiazepam], diazepam and structurally related compounds, 
are potent and selective PBR ligands. Exogenous ligands also include 2-phenylquinoline 
carboxamides (PK11195 series), imidazo [l,2-a]pyridine-3-acetamides (Alpidem series), 
pyridazine, and isoquinilone derivatives. Some endogenous compounds, including porphyrins and 
diazepam binding inhibitor (DBI), bind to BZRP-R2. 
35 In one embodiment, a preferred polypeptide of the invention comprises the amino acids of 

SEQ ID NO: 68 from position 144 to 154: VTSALTYHLWR. Further preferred fragments of 
BZRP-R2 comprise the epitope: ALPLRLYAV or fragments thereof. In another embodiment, the 
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subject invention provides a polypeptide comprising the sequence of SEQ ID NO: 68. Other 
preferred polypeptides of the invention include biologically active fragments of SEQ ID NO: 68. 
Biologically active fragments of the protein of BZRP-R2 have any of the biological activities 
described herein. In another embodiment, the polypeptide of the invention is encoded by clone 
5 646607 215-15-5-0-B11-F. 

A preferred embodiment of the invention is a method of screening for compounds that 
modulate the expression of BZRP-R2. This method comprises the steps of i) contacting a cell with 
a test compound and ii) comparing the level of BZRP-R2 polypeptides in a cell after exposure to 
the test compound to that of an untreated control cell. The level of BZRP-R2 polypeptides may be 

10 inferred by detecting mRNA for BZRP-R2 by methods common to the art such as Northern 

blotting or RT-PCR. The level of BZRP-R2 polypeptides may also be detected by antibody-based 
methods common to the art such as Western blotting or immunofluorescence. Test compounds 
that increase BZRP-R2 expression are useful as agonists, as discussed herein. Test compounds 
that decrease BZRP-R2 expression are useful as antagonists, as discussed herein. 

15 Antagonists of BZRP-R2 include agents which decrease the levels of expressed mRNA 

encoding the protein of SEQ ID NO: 68. These include, but are not limited to, RNAi, one or more 
ribozymes capable of digesting the protein of the invention, or antisense oligonucleotides capable 
of hybridizing to mRNA encoding BZRP-R2. Antisense oligonucleotides can be administrated as 
DNA, RNA, as DNA entrapped in proteoliposomes containing viral envelope receptor proteins 

20 [Kanoda, Y. et al. (1 989) Science 243 : 375, which disclosure is hereby incorporated by reference 
in its entirety] or as part of a vector which can be expressed in the target cell to provide antisense 
DNA or RNA. Vectors which are expressed in particular cell types are known in the art. 
Alternatively, the DNA can be injected along with a carrier. A carrier can be a protein such as a 
cytokine, for example interleukin 2, or polylysine-glycoprotein carriers. Carrier proteins, vectors, 

25 and methods of making and using polylysine carrier systems are known in the ait Alternatively, 
nucleic acid encoding antisense molecules may be coated onto gold beads and introduced into the 
skin with, for example, a gene gun [Ulmer, J.B. et al. (1993) Science 259: 1745, which disclosure is 
hereby incorporated by reference in its entirety]. 

A preferred embodiment of the invention is a method of screening for compounds that bind 

30 to BZRP-R2 polypeptides. Such compounds are useful for developing agonists and antagonists of 
BZRP-R2 activity. This method comprises the steps of: i) contacting a BZRP-R2 polypeptide or 
fragment thereof with a test compound under conditions that allow binding to occur and ii) 
detecting binding of said test compound. Binding may be detected by any method common to the 
art such as competition with a labeled antibody specific for BZRP-R2 or by direct labeling of each 

35 test substance. In one example of such a method, a polynucleotide encoding a BZRP-R2 
polypeptide or a biologically active fragment thereof is transformed into a eukaryotic or 
prokaryotic host cell. The transformed cells may be viable or fixed. Drugs or compounds which 
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are candidates for binding BZRP-R2 polypeptides are screened against such transformed cells in 
binding assays well known to those skilled in the art. Alternatively, assays such as those taught in 
Geysen H. N., WO Application 84/03564, published on Sep. 13, 1984, and incorporated herein by 
reference in its entirety, may be used to screen for peptide compounds which demonstrate binding 
5 affinity for BZRP-R2 polypeptides or fragments thereof. In another embodiment, competitive drug 
screening assays using neutralizing antibodies specifically compete with a test compound for 
binding to BZRP-R2 polypeptides or fragments thereof. Preferred test compounds are those 
included in the benzodiazepine class, such as diazepam (i.e., Valium), triazolobenzodiazepine, and 
adinazolam, as well as modified versions thereof. Further preferred test compounds are in the 

1 0 imidazo pyridine and isoquinilone classes. 

A variety of drug screening techniques may be employed. In this aspect of the invention, 
BZRP-R2 polypeptide or biologically active fragments thereof, may be free in solution, affixed to 
a solid support, recombinantly expressed on or chemically attached to a cell surface, or located 
intracellularly. The formation of binding complexes between BZRP-R2 polypepetides or 

. 1 5 biologically active fragments thereof, and the compound being tested, may then be measured as 
described. 

Another embodiment of the subject invention provides compositions and methods of 
selectively modulating the activity of the protein of the invention. Modulation of BZRP-R2 allows 
for the successful prevention, treatment, or management of disorders or biochemical abnormalities 

20 associated with BZRP-R2. Agonist compounds are those that increase the amount of BZRP-R2 
. polypeptides in a cell or increase the biological activity of BZRP-R2. A preferred embodiment of 
the invention is a method of screening for agonists that bind to BZRP-R2 comprising the steps of: 
i) screening for test substances that bind to BZRP-R2, as described above and ii) detecting BZRP- 
R2 biological activity. Preferably, this method is accomplished in an intact cell. Further 

25 preferably, the cell is a steroidogenic cell such as a testicular or ovarian cell. Preferably, the 

biological activity of BZRP-R2 is determined by measuring the concentration of steroid hormones 
released from the cell before and after exposure to the test substance. Agonists of BZRP-R2 will 
increase the release of steroid hormones from the cell. Antagonist compounds are those that 
decrease the amount of BZRP-R2 polypeptides in a cell or decrease the biological activity of 

30 BZRP-R2. Another preferred embodiment of the invention is a method of screening for antagonists 
that bind to BZRP-R2 comprising the steps of: i) screening for test substances that bind'toBZRP- 
R2, as described above and ii) detecting BZRP-R2 biological activity. Preferably, this method is 
accomplished in an intact cell. Further preferably, the cell is a steroidogenic cell such as a 
testicular or ovarian cell. Preferably, the biological activity of BZRP-R2 is determined by 

35 measuring the concentration of steroid hormones released from the cell before and after exposure 
to the test substance. Antagonists of BZRP-R2 will decrease the release of steroid hormones from 
the cell. 
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Antagonists, able to reduce or inhibit the expression or the activity of the protein of the 
invention, are useful in the treatment of diseases associated with elevated levels of BZRP-R2, 
increased cell proliferation or reduced apoptosis, and increased cholesterol transport Thus, the 
subject invention provides methods for treating a variety of diseases or disorders, including but not 
5 limited to cancers, especially liver and breast cancer, and portal-systemic encephalopathy. 
Increased cholesterol transport into the mitochondria of steroidogenic cells results in higher than 
normal production of steroid hormones such as progesterone, testosterone, and estrogen. 
Abnormally high levels of steroid hormones lead to disruption of adrenocortical feedback 
mechanisms and underproduction of trophic hormones from the hypothalamus and pituitary. 

1 0 Inhibition of BZRP-R2 and steroidogenesis may increase levels of trophic hormones such as 
gonadotropin-releasing hormone. 

Alternatively, the subject invention provides a method of treating diseases or disorders 
associated with decreased levels of BZRP-R2 polypeptides and decreased steroid hormone release 
with an agonist thereof. Such method comprises the step of contacting a cell with a BZRP-R2 

15 agonist. This method comprises the step of contacting a cell with an agonist of BZRP-R2. Thus, 
the subject invention provides methods of treating disorders including, but not limited to, 
schizophrenia, chronic stress, GAD, PD, GSP and PTSD. Other disorders which may be treated by 
agonists of BZRP-R2 include those associated with decreases in cell proliferation, e.g. 
developmental retardation. Furthermore, because BZRP-R2 is able to transport cholesterol into 

20 cells, BZRP-R2 agonists may also be used to increase cholesterol transport into cells. Diseases 
associated with cholesterol transport deficiencies include lipoidal adrenal hyperplasia, ovarian 
cysts, abnormal lipid deposits in steroidogenic cells. Disorders that reflect a requirement for 
cholesterol for myelin and myelination, include Alzheimer's disease, multiple sclerosis, spinal cord 
injury, and brain development neuropathy. The methods of treating disorders associated with 

25 . decreased levels of BZRP-R2 may be practiced by introducing agonists which stimulate the 
expression or the activity of BZRP-R2. 

Additionally, disorders resulting from defective mitochondrial activity may be treated with 
an agonist to BZRP-R2. Defective mitochondrial activity may alternatively or additionally result 
• in the generation of highly reactive free radicals that have the potential of damaging cells and 

30 tissues. These free radicals may include reactive oxygen species (ROS) such as superoxide, 

peroxynitrite and hydroxyl radicals, and other reactive species that may be toxic to cells and cause 
. apoptosis. For example, oxygen free radical induced lipid peroxidation is a well-established 

pathogenic mechanism in central nervous system (CNS) injury such as that found in a number of 
. degenerative diseases, and in ischemia (i.e., stroke). Diseases associated with altered mitochondrial 

35 function and apoptosis include: Alzheimer's Disease, diabetes mellitus, Parkinson's Disease, 
Huntington's disease, dystonia, Leber's hereditary optic neuropathy, schizophrenia, mitochondrial 
encephalopathy, lactic acidosis, and stroke. 
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A further preferred embodiment includes a method of inhibiting apoptosis of cells in 
culture. This method comprises the step of contacting a cell in culture with an agonist to BZRP- 
R2. Such methods are useful for culturing cells that are notoriously undergo apoptosis, such as 
primary neurons and lymphocytes. 
5 In one embodiment, the level of BZRP-R2 in a cell may be increased by introducing 

nucleic acids encoding a BZRP-R2 polypeptide or biologically active fragment thereof into a 
targeted cell type. Vectors useful in such methods are known to those skilled in the art, as are 
methods of introducing such nucleic acids into target tissues. 

Antibodies or other polypeptides capable of reducing or inhibiting the activity of BZRP-R2 

10 may be provided as in isolated and substantially purified form. Alternatively, antibodies or other 
polypeptides capable of inhibiting or reducing the activity of BZRP-R2 may be recombinantly 
expressed in the target cell to provide a modulating effect. In addition, compounds which inhibit 
or reduce the activity of BZRP-R2 may be incorporated into biodegradable polymers being 
implanted in the vicinity of where drug delivery is desired. For example, biodegradable polymers 

1 5 may be implanted at the site of a tumor or, alternatively, biodegradable polymers containing 
antagonists/agonists may be implanted to slowly release the compounds systemically. 
Biodegradable polymers, and their use, are known to those of skill in the art (see, for example, 
Brem et al. (1991) J. Neurosurg. 74:441-446, which disclosure is hereby incorporated by reference 
m its entirety). 

20 In another embodiment, the invention provides methods and compositions for detecting the 

level of expression of the mRNA encoding the protein of the invention. Quantification of mRNA 
levels of BZRP-R2 may be useful for the diagnosis or prognosis of diseases associated with an 
altered expression of the protein of the invention. Assays for the detection and quantification of the 
mRNA encoding BZRP-R2 are well known in the art (see, for example, Maniatis, Fitsch and 

25 Sambrook, Molecular Cloning; A Laboratory Manual (1982), or Current Protocols in Molecular 
Biology, Ausubel, F.M. et al. (Eds), Wiley & Sons, Inc., disclosures of which are hereby 
incorporated by reference in their entireties). 

Polynucleotides probes or primers for the detection of BZRP-R2 mRNA can be designed 
from the cDNA of SEQ ID NO: 67. Methods for designing probes and primers are known in the 

30 art. In another embodiment, the subject invention provides diagnostic kits for the detection of the 
mRNA of the protein of the invention in cells. The kit comprises a package having one or more 
containers of oligonucleotide primers for detection of the protein of the invention in PCR assays or 
one or more containers of polynucleotide probes for the detection of the mRNA of the protein of 
the invention by in situ hybridization or Northern analysis. Kits may, optionally, include 

35 containers of various reagents used in various hybridization assays. The kit may also, optionally, 
contain one or more of the following items: polymerization enzymes, buffers, instructions, 
controls, or detection labels. Kits may also, optionally, include containers of reagents mixed 



240 



WO 02/094864 



PCT7IB01/01715 



together in suitable proportions for performing the hybridization assay methods in accordance with 
the invention. Reagent containers preferably contain reagents in unit quantities that obviate 
measuring steps when performing the subject methods. 

In another embodiment, the invention relates to methods and compositions for detecting 

5 and quantifying the level of the protein of the invention present in a particular biological sample. 
These methods are useful for the diagnosis or prognosis of diseases associated with altered levels 
of the protein of the invention. Diagnostic assays to detect the protein of the invention may 
comprise a biopsy, in situ assay of cells from organ or tissue sections, or an aspirate of cells from a 
tumor or normal tissue. In addition, assays may be conducted upon cellular extracts from organs, 

10 tissues, cells, urine, or serum or blood or any other body fluid or extract. 

Assays for the quantification of BZRP-R2 polypeptides may be performed according to 
methods well known in the art. Typically, these assays comprise the steps of: contacting the 
sample with a ligand of the protein of the invention or an antibody (polyclonal or monoclonal) that 
specifically recognizes the protein of the invention or a fragment thereof and detecting the complex 

15 formed between the protein of the invention present in the sample and the ligand or antibody. 
Fragments of the ligands and antibodies may also be used in the binding assays, provided these 
fragments are capable of specifically interacting with BZRP-R2 polypeptides. Further, ligands and 
antibodies which bind to BZRP-R2 may be labeled according to methods known in the art. Labels 
which are useful in the subject invention include, but are not limited to, enzymes labels, 

20 radioisotopic labels, paramagnetic labels, and chemiluminescent labels. Typical techniques are 
described by Kennedy, J. H., et al. (1976) Clin. Chim. Acta 70:1-31; and Schurs, A. H. et al. 
(1 977) Clin. Chim. Acta 81:1 -40, disclosures of which are hereby incorporated by reference in 
their entireties. 

The subject invention also provides methods and compositions for the identification of 
25 metastatic tumor masses. In this aspect of the invention, the polypeptide or antibody that 

specifically binds a BZRP-R2 polypeptide or fragment thereof may be used as a marker for the 

identification of the metastatic tumor mass. Metastatic tumors which originated from the breast or 

liver may overexpress BZRP-R2 polypeptides, whereas newly forming tumors, or those originating 

from other tissues are not expected to bear BZRP-R2. 
30 Protein of SEQ ID NO:70 (Internal designation Clone 229654 J14-049-1-0-F12-F (cFS)) 

The cDNA of Clone 229654J 14-049-1-0-F12-F (SEQ ID NO:69) encodes the 787 amino 

acid long polypeptide called LAP of SEQ ID NO:70 comprising the amino acid sequence : 

MFRLWLLLAGLCGLLASRPGFQNSLLQIVIPEKIQTNTN^ 

KQRYFLTDNFMTxTW 

35 vsygieplesavefqhvlhkl^ 

emhivvdktlydywgsdsmwtnkvieivglans 
eadellqkflewkqsylnlrphdiaylliymdyprylgawpg™ 
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ITLEAFAVIVTQMLALSLGISY 
NVGVKCLQNKPQMQKKSPKPVCGNGRLEGNEIOT 
CYKGLCCKDCQILQSGVECI^KAHPECDIAENCNGSSPECGP 
DCHDLDARCESWGKGSRNAPFACYEEIQSQSDRFGNCGRDRN^ 
5 VCTYPTRKPFHQENGDVI^^ 

CVESRIIKASAHVCSQQCSGHGVCDSR^ 
ASGKTENITVTLGFLIALPILIV^ 

EGSTQTYAGQTRSESSSQADTSKSKSEDSAEAYTSRSKSQDSTQTQSSSN. Accordingly, it 
will be appreciated that all characteristics and uses of polypeptides of SEQ ID NO:70 described 

10 throughout the present application also pertain to the polypeptides encoded by the nucleic acids 
included in Clone 229654_1 14-049-1-0-F12-F. In addition, it will be appreciated that all 
characteristics and uses of the polynucleotides of SEQ ID NO:69 described throughout the present 
application also pertain to the nucleic acids included in Clone 229654_1 14-049-1-0-F12-F. A 
preferred embodiment of the invention is directed toward the compositions of SEQ ID NO:69, 

15 SEQ ID NO:70, and Clone 229654_1 14-049-1-0-F12-F. Also preferred are polypeptide fragments 
having a biological activity as described herein and the polynucleotides encoding the fragments. 

LAP, the protein of SEQ ID NO:70, is a new member of the ADAM (A Disintegrin And 
Metalloprotease domain) family of proteins. The gene for Clone 229654.cFS is located on 
chromosome 8 and is expressed in tissues including liver, adipose and testis. 

20 LAP, as an ADAM family member is a membrane-anchored cell surface protein. The 

members of this family form a large group of cell surface adhesion molecules and proteases whose 
name describes the two domains that these proteins share with their closest relatives, the PEH class 
of snake venom metalloproteinases (SVMPs). The ADAM proteases fall with the SVMPS within 
the adamalysin/reprolysin subfamily of Zinc-dependent metalloproteinases. ADAMs have also 

25 been refeiTed to as MDCs (metaUoproteinase/disintegrin/cysteine-rich), cellular disintegrins, and 
. metalloproteinase-desintegrins. 

These proteins have been isolated from a wide range of organisms ranging from yeast, 
worm, flies, frogs and mammals. Expression studies have shown that while some ADAMS have 
wide tissue expression, some have their expression restricted to one tissue. 

30 The ADAM proteins have been shown to function in cell-cell interaction, cell-signaling, 

and in the processing of the ectodomains of membrane-anchored proteins and have been implicated 
in diverse biological processes, including sperm-egg binding and fusion, myoblast fusion, protein- 
ectodomain shedding of cytokines, cytokine receptors, adhesions and other extracellular protein 
domains. Furthermore, they have been shown to be necessary for proper axonal guidance, neural 

35 and wing development in Drosophila, vulval development in Caenorhabditis elegans, and epithelial 
maturation and skin and hair development in the mouse. 

Structurally, LAP has an N-terminal signal sequence (MFRLWLLLAGLCGLLAS), a 
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prodomain 

(HIXQRYFLTONFMIYU 

FENVSYGIEPLESAVEFQHVLHKLKNEDND that has been 

shown to maintain the enzyme in an inactive state, followed by a rnetalloprotease domain 
5 (LYLEMHIVVDKTLYDYWGSDSMIVTNKVIEIV 
STVGEADELLQKFLEWKQSYLNLRPHD 
YPKEITLEAFAVIVTQMLALSLGISYDDPK^ 

QNFISNVGVKCLQNKP) that is important for proteolysis and contains the zinc-binding catalytic 
site, a disintegrin-like domain 
10 (OVCGNGRLEGNEICDCGTEAQCGP^^ 

CPKAHPECDIAENCNGSSPEC) that has been demonstrated to bind integrins, a cystein-rich 
region 

(GLSCKNNKFICYDGDCHDLDARCTSWGKGSRNAPFAC^ 

YWCGWRNLICGRLVCTYPTRKPraQENGD\WA that also have adhesion 

1 5 activity, an EGF-like domain 

(CDIGRVCVNREC\TCSRIIKASAE^ important 
for substrate recognition, a transmembrane domain (TWLLGFLIAIJPILIVTTAIVL) and a 
cytoplasmic tail 

(ARKQXJCNTWFAKEEEFPSSESKSEGST^ 
20 EDSAEAYTSRSKSQDSTQTQSSSN) that has been shown in many ADAMs to contain SH3 
binding sites and which might be important for cell signaling. 

Interestingly, the cytoplasmic C-terminal domain of the LAP protein does not contains any 
SH3 binding sites but it ends by a 69 amino acid region rich in serine/threonine residues (36% of 
serine residues; 

25 SSESKSEGSTQTYASQSSSEGSTQTYAGQTRSESSSQADTSKSKSEDSAEAYTSRSKSQDST 
QTQSSS). 

LAP contains both a disintegrin-like and a rnetalloprotease domain, and has both cell 
adhesion and protease activities. However, LAP lacks the catalytic site consensus sequence in its 
rnetalloprotease domain (QMLALSLGISYD). LAP, like fertilin beta another catalytically inactive 
30 protease, is processed on the sperm cell surface during sperm maturation in the epididymis 
yielding mature protein that retains dismtegrin domain on fertilization-competent sperm. 

Preferred LAP polypeptides for uses in the methods described below include the 
polypeptides comprising the amino sequence of: 

kpvcgngrlegneiox:gteaqcgp 
35 crpkahpecdiaeno^gss 

rnapfacyeeiqsqsdrfgncgrdr^ 
yafvrdsvciivdykijrtwdplavkngsqcdigrv 
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GHGVCDSRNKCTCSPGYKPPN^ 

LIVTTAIVLARKQLKNWFAKEEEFPSSESK^ 

ADTSKSKSEDSAEAYTSRSKSQDSTQTQSSSN; 

A polypeptide comprising the amino acid sequence of: 
5 KPVCGNGRLEGNEICDCGTEA 

CRPKAHPECDIAENC^GSSPECGPDITLINGLSCKNNm 
RNAPFACYEEIQSQSDRFGNCGRDRNNKYWCGWK^ 
YAFVRDSVCITVDYKIPRTWDPLAVKNGSQ^ 

GHGVCDSRNKCHCSPGYKPPNCQIRSKGFSIFPEEDMGSIMERASGKTEN 
1 0 A polypeptide comprising the amino acid sequence of: 

KPVCGNGRLEGNEICDCGTEAQCGPASCCDFRTCVLKDGAKCYKGLCt^CQILQSGVE 
CRPKAHPECDIAENC^ 

RNAPFACYEEIQSQSDRFGNCGRDRNNKYWCGWI^LICG 
YAFVRDSVC 

15 A polypeptide comprising the amino acid sequence of: 

KPVCGNGRLEGNEICDCGTEAQ 
. CRPKAHPECDIAENCNGSSPECGPD 

A polypeptide comprising the amino acid sequence of: 
GI^CKNNKJICTO 
20 WCGWRNLICGRLVCTYPTRKPFHQENGDVIY^ 

A polypeptide comprising the amino acid sequence of: 
PSSESKSEGSTQTYASQSSSEGSTQTYAGQTRSESSSQADTSKSKSEDSAEAYTSRSKSQDS 
TQTQSSSN 

An embodiment of the invention is directed to a method to screen for molecules which 
25 block the interaction of LAP with the cell-surface receptors on the oocyte surface comprising the 
steps of contacting sperm with said molecule to be screen, contacting the sperm with the oocyte, 
and disrupting sperm-oocyte binding. 

A preferred embodiment of the invention is directed to a method of inhibiting sperm- 
oocyte interaction by blocking the LAP interaction with the oocyte cell surface comprising the 
30 steps of contacting sperm with a blocking molecule, as identified in a screen, which inhibits or 
blocks the LAP-oocyte interaction. Preferred agents include antibodies directed to LAP- 
disintegrin domain. 

LAP is a plasma membrane-anchored protein having adhesion and cell signaling activities 
in liver cells as well as in adipocytes. More specifically, it is believed that the mature LAP protein 
35 interacts, via its extracellular domain, with as yet unidentified integrins and other proteins present 
at the surface of neighbouring cells, while its cytoplasmic serine-rich domain is involved in 
signaling events by interacting with cytoplasmic or plasma membrane-associated proteins that 
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interact with serine-rich domains. More over, as serine and threonine residues are both 
phosphoiylatable residues, the signaling activity of the LAP protein is regulated by 
phosphorylation/dephosphorylation events of specific serine and or threonine residue(s) present on 
this domain. 

5 In a further embodiment, polyclonal or monoclonal antibodies directed against 

polypeptides of the invention are used in methods to reduce or inhibit cell-cell interactions between 
cells in vitro, preferably liver or adipose cells. A preferred method of reducing or inhibiting cell- 
cell interactions comprises the steps: i) contacting the cells with a composition comprising an 
inhibitory-effective amount of an antibody directed against polypeptides of the invention, 

10 preferably a monoclonal antibody directed against the disintegrin-like domain or a monoclonal 
antibody directed against the cysteine-rich domain. 

A further embodiment is directed to a method of blocking or inhibiting the interaction of 
LAP with at least one of its binding-partners comprising the steps: i) contacting cells with a 
blocking-effective amount of a polypeptide fragment of the invention comprising an extracellular 

15 domain of LAP. Preferred extracellular domains to be used in said methods of blocking the 
interaction of LAP and a binding partner include the disintegrin-like domain of LAP and the 
cysteine-rich domain of LAP. Preferred synthetic peptides to be used in compositions of said 
methods have amino acid sequences comprising CRPKAHPECDIAENC or 
CGNGRLEGNEICDCG, or a combination thereof. 

20 Protein of SEQ ID:72 (Internal designation Clone 338116 _174-l-l-0-B10-F) 

The cDNA of Clone 338116J74-1-1-0-B10-F (SEQ ID NO:71) encodes the protein of 
SEQ ID NO:72, herein referred as Short Histone Deacetylase (SHDAC), comprising the amino 
acid sequence: 

MGPHLHLCLCWDLRSLRVCVSLWSVH^ 

25 SGIAATPASAAAATLDVAVRRGLSHA^ 

AAALSMFHVSTPLJVMTGGFLSCILGLVLPIAYGFQPDLVLVALG 
MLRGLAGGRVLALLEENSTPQLAGn.ARVLN^ 
- PQWKMLQCHPHLVA, is encoded by the cDNA clone 3381 16,174-1 -1-0-B10-F (SEQ ID:71). 
The protein of SEQ ID NO:72 is a novel variant of histone deacetylase (HDAC). Accordingly, it 

30 : will be appreciated that all characteristics and uses of the polypeptide of SEQ ID NO:72 described 
throughout the present application also pertain to the polypeptide encoded by a nucleic acid 
included in clone 338116_174-l-l-0-B10-F. In addition, it will be appreciated that all 
characteristics and uses of the nucleic acid of SEQ ID NO:71 described throughout the present 
application also pertain to the nucleic acid included in clone 3381 16_174-l-l-0-B10-F. A 

35 preferred embodiment of the invention is directed toward the compositions of SEQ ID NO:71, 
SEQ ID NO:72, and Clone 3381 16_1 74-1-1 -0-B10-F. Also preferred are polypeptide fragments 
having a biological activity as described herein and the polynucleotides encoding the fragments. 
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The protein of SEQ ID:72 contains one potential transmembrane segment (position 130 to 
150), and a signal peptide (position 1: MGPHLHLCLCWDLRSL). The protein of SEQ ID:72 is 
highly expressed in placenta and salivary glands. 

Histone deacetylase (HDAC) proteins comprise a family of related proteins that act in 
5 conjunction with histone acetyl-transferase proteins to modulate chromatin structure and 
transcriptional activity via changes in the acetylation status of histones. HDACs remove acetyl 
groups from histones by hydrolysis [Davie, J.R. Cuir. Opin. Genet. Dev. 8, 173-178 (1998)], 
thereby causing local chromatin condensation and decreasing the accessibility of particular DNA 
regions for RNA polymerase complexes. In fact, transcriptionally active chromatin correlates with 

10 histone hyperacetylation [Grundstein M. Nature 389:349-352 (1997)], and it has been suggested 
that histone acetyltransferases promote transcription while histone deacetylases act as repressors 
and transcriptional silencer [Doetzlhofer A. et al., Mol. Cell. Biol. 19:5504-551 1(1999)]. 

Histone deacetylase proteins belong to a superfamily of zinc metalloenzymes with a 
conserved 380 residue catalytic domain [Finnin, M.S. et al., Nature 401:188-193(1999)]. Histone 

1 5 deacetylases are found in high-molecular-weight complexes associated with adapter proteins like 
SIN3, RbAp46/48, SAP 1 8, S AP30, and nuclear corepressors like N-CoR, SMRT, and SUN-CoR 
[Alland, L. et al., Nature 387:49-55 (1997); Heinzel, T. et al., Nature 387:43-48 (1997); Laherty, 
CD. et al., Cell 89: 349-356 (1997); Nagy, L.H. et al., Cell 89:373-380 (1997); Zhang, W. et al., 
EMBO J. 17:3155-3167 (1997); Zhang; Y. et al., Cell 89:357-364 (1997); Knoepfker, P.S. & 

20 Eisenman, R.N. Cell 99:447-450 (1999)]. 

Histone deacetylases are recruited to specific promoters by mammalian transcriptions 
factors such as Matrix-associated Deacetylase (Mad) [Sommer, A. et al., Curr. Biol. 7 :357-365 
(1997)], YY1 [Yang, W.M. Proc. Natl. Acad. Sci. USA 93:12845-12850 (1996)], hormone- 
dependent nuclear receptor [Nagy, L.H. .et al., Cell 89:373-380 (1997)], MeCP2 [Jones, PX. et al., 

25 Nat. Genet. 19:187-191 (1998)], CBF [Kao, H.Y.P. et al., Genes Dev. 12:2269-2277 (1998)], 
Retinoblastoma protein (Rb) [Brehm, A. et al., Nature 391:597-601 (1998)], groucho [Chen, G. et 
al., Genes Dev. 13:2218-2230 (1999)] B-lymphocyte-induced maturation protein [Yu, J. et al., 
Mol. Cell. Biol. 20:2592-2603 (2000)] and related pocket proteins [Ferreira et al., Proc. Natl. 
Acad. Sci. USA 95:10493-10498 (1998)] for repression. The recruitment of human histone 

30 deacetylases by PZLF (promyelocytic leukaemia zinc finger), PML (promyelocyte leukaemia), 
and ETO fusion proteins can interfere with differentiation of hematopoietic precursor cells in acute 
promyelocytic leukemia [Lin, R.J. et al., Nature 31 1:811-815 (1996); David, G.L. et al., Oncogene 
16:25492556 (1998); Grignani, F.S. et al., Nature 391:815-818 (1998); Guidez et al., Blood 
91:2634-2642(1998)]. 

35 Several drugs have been identified as acting upon histone acetylation. Some examples are: 

trichostatin A (TSA), apicidin (antiprotozoal agent), superoylanilide hydroxamic acid (SAHA), 
cyclic hydroxamic acid-containing peptide (CHAP) 1, FR901228 (a potent antitumor), CBHA (m- 
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carboxycinnamic acid bis-hydroxamide), trapoxin, MS-275 (antitumor), pyroxamide (suberoyl-3- 
aminopyridineamide hydroxamic) acid and phenyl butyrate. Such drugs cause major alterations in 
cellular activity, including the induction of cellular differentiation and apoptosis [Medina, V. et al., 
Cancer Res. 57:3697-3707 (1997); Richon, V.M. et al., Proc. Natl. Acad. Sci. U.S.A. 95:3003- 
5 3007 (1998); Sambucetti, L.C. et al., J. Biol. Chem. 274:34940-34947 (1999); Buter, L.M. et al., 
Clin. Cancer Res. 7:962-970 (2001); Coffey, D.C. et al., Cancer Res. 61:3591-3594 (2001); 
Colletti, S.L. et al., Bioorg. Med. Chem. Lett. 1 1:107-1 1 1 (2001); Furumai, R. et al., Proc. Natl. 
Acad. Sci. USA 98:87-92 . (2001); Lee, B.I. et al., Cancer Res. 63:931-934 (2001)]. 

The protein of SEQ ID NO:72 is a novel splice and polymorphism variant of histone 

10 deacetylase and, as such, plays a role in transcription, chromosome stability, cell cycle progression, 
. gene silencing, lymphocyte and muscle differentiation, aging, regulation of neuronal phenotype, 
DNA replication and the response to DNA damage. Particularly, the protein of the invention may 
deacetylate substrates, preferably acetylated histones, either directly or indirectly as enzymes 
cofactors. Preferred polypeptides of the invention are polypeptides comprising the amino acids of 

15 SEQ ID NO:72 from positions 29 to 252. Also preferred are fragments of SEQ ID NO:72 having a 
biological activity as described therein and the polynucleotides encoding the fragments. The 
deacetylation activity of the protein of the invention or fragment thereof may be assayed using any 
of a number of methods known to those skilled in the art. 

The invention relates to methods and compositions using the protein of SEQ ID NO: 72 or 

20 fragment thereof to inhibit or modulate cellular transcriptional activity, thereby modulating cellular 
differentiation. Specifically, as histone deacetylases play a role in inhibiting transcription 
associated with differentiation, then an increase in the activity or expression of the protein can be 
used to inhibit differentiation. The ability to inhibit differentiation has a number of uses, for 
example during the cultivation of undifferentiated pluripotent cells to maintain the cultured cells in 

25 an undifferentiated state until the need for a given cell type arises (in cases of grafts for instance). 
For example, the histone deacetylase of the invention may be used to arrest a population of non- 
neoplastic cells grown in vitro in the Gl or G2 phase of the cell cycle. Such synchronization 
allows, for example, the identification of gene and/or gene products expressed during the Gl or G2 
phase of the cell cycle. Such a synchronization of cultured cells may also be useful, for testing the 

30 efficacy of a new transfection protocol, where transfection efficiency varies arid is dependent upon 
the particular cell cycle phase of the cell to be transfected. Use of the histone deacetylase of the 
invention allows the synchronization of a population of cells, thereby adding detection of enhanced 
transfection efficiency. The level of the protein activity or expression can be increased in any of a 
number of ways, including by introducing a polynucleotide encoding the protein into cells, by 

35 administering the protein itself to cells, or by administering to cells a compound that increases 
protein activity or expression. Alternatively, the expression or activation of the protein of the 
invention can be inhibited in any of a large number of ways, including using antisense 
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oligonucleotides, antibodies, dominant negative forms of the protein, and using heterologous 
compounds that decrease the expression or activation of the protein. Such compounds can be 
readily identified, e.g. by screening candidate compounds and detecting the level of expression or 
activity of the protein using any standard assay. The ability to promote differentiation has many 
5 uses, including in the treatment or prevention of cancer, as cancer cells are often in a relatively 
undifferentiated state, and cellular differentiation typically accompanies by growth arrest. 

In another embodiment, eukaryotic cells are genetically engineered in order to express the 
protein of the invention or fragment thereof under specific conditions in order to prevent and/or 
treat disorders characterized by abnormal cell proliferation and/or programmed cell death, 

10 including but not limited to cancer, immune deficiency syndromes (including AIDS), type I 
diabetes, pathogenic infections, cardiovascular and neurological injury, alopecia, aging, 
degenerative diseases such as Alzheimer's Disease, Parkinson's Disease, Huntington's disease, 
dystonia, Leber's hereditary optic neuropathy, schizophrenia, and myodegenerative disorders such 
as "mitochondrial encephalopathy, lactic acidosis, and stroke" (MELAS), and "myoclonic epilepsy 

1 5 ragged red fiber syndrome" (MERRF). For example, a vector capable of expressing the protein of 
SEQ ID NO: 72, or biologically active fragments thereof, can be administered to a subject to treat 
or prevent disorders including, but not limited to, those described above. Alternatively, the vector 
can encode a variant, or biologically active fragment of the variant protein. Multiple vectors 
encoding any combination of SEQ ID NO: 72, variants, and/or biologically active fragments of 

20 SEQ ID NO: 72 and/or variants can be administered to a subject. 

The invention relates to methods and compositions using the protein of the invention or 
fragment thereof to deacetylate substrates, alone or in combination with other substances, for 
example, but not limited to silence specific target genes. Acetylated substrates used in such 
methods are preferably acetylated histones and acetyltransferases. For example, the protein of the 

25 invention or fragment thereof is added to a sample containing a substrate in conditions allowing 
deacetylation, and allowed to catalyze the deacetylation of the substrate. In a preferred 
embodiment, the deacetylation is carried out using a standard assay such as those described in 
Landry and collaborators [Landry et al., Proc. Natl. Acad. Sci. 97:5807-581 1 (2000), the disclosure 
of which is incorporated by reference in its entirety]. Deacetylated histones obtained by this 
, 30 method may be mixed with purified naked DNA (plasmid preparations for example) in order to 
reconstitute chromatine-like structures in vitro. Such structures are of great interest in the study of 
enzymatic factors involved in transcription and replication. Natural transcription factors are unable 
to enter the condensed chromatin, and the gene function is effectively switched-off. Also, the 
chromatin condensation constitutes a valuable parameter in the assessment of male fertility, 

35 completely independent of conventional sperm parameters [Hammadeh, ME., et al., Arch 
Androl;46(2):99-104 (2001)]. 

Another embodiment of the present invention relates to composition and methods of using 
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the protein of the invention or fragment thereof to screen for inhibitors and activators of 
deacetylase activity. Such deacetylase inhibitors are of great potential as new drugs due to their 
ability to influence transcriptional regulation and to induce apoptosis or differentiation in cancer 
cells [Marks PA. et al., Clin. Cancer Res. 7:759-760 (2001)], and also as antiproliferative reagents 
5 involved in antiprotozoal, antifungal, phytotoxic and antiviral applications [Meinke, P.T. & 
Liberator, P., Curr. Med. Chem. 8:21 1-235 (2001)]. In one such embodiment, the protein of the 
invention is contacted in vitro with a fluorescently labeled acetylated substrate as well as a test 
agent, and the activity of the protein is detected, wherein a difference in the activity of the protein 
in the presence of the test agent in comparison to the activity in the absence of the test agent 

10 indicates that the test agent is a modulator of the protein. Suitable substrates include, e.g., 

aminocoumarin derivative of an acetylated lysine, which can be quantitated using a reverse-phase 
HPLC-system with a fluorescence detector [see, e.g., Hoffmann et al., Nucl. Acids Res. 27:2057- 
2058 (1999); Hoffinann et al., Pharmazie 55:601-606 (2000); the disclosures of each of which are 
incorporated herein in their entireties]. 

15 In another preferred embodiment, the polynucleotides of SEQ ID:71, polypeptides of SEQ 

ID:72 or antibodies to the polypeptide of the present invention may also be used in screening 
methods for detecting an abnormally decreased or increased level of polypeptides or mRNA, as 
well as to detect the effect of added compounds on the production of the present mRNA and 
. polypeptide in cells. Abnormal activity of our protein is associated with accelerated aging 

20 syndromes such as Cochayne's syndrome, Ataxia telangiectasia and Werner's syndrome as well as 
age-associated diseases as well as "early onset" forms of diseases associated with old age such as 
dementia and Parkinson's disease. Decreased or increased expression can be measured, for 
example, at the RNA level using any of the methods known in the art for the quantification of 
polynucleotides, such as nucleic acid amplification methods including PCR and RT-PCR, as well 

25 as RNAse protection, Northern blotting and other hybridization methods. Expression can also be 
detected using assays to determine levels of the present protein, such as ELISA assays.. These 
methods can also be used to discover agents which inhibit or enhance the production of 
polypeptide in cells or tissues. Examples of potential polypeptide inhibitors include antibodies, 
oligonucleotides, heterologous proteins, or small molecule inhibitors of the present protein. 

30 Another embodiment of the invention relates to methods of preparing antibodies that 

selectively bind to the protein of the invention or fragment thereof. Such antibodies may be used, 
for example, in co-immunoprecipitation procedures that enrich for chromatin fragments containing 
binding sites for the protein of the invention. This method may identify genes or regions of the 
human genome silenced by the deacetylase activity of the protein of the invention and also proteins 

35 which interact with the compacted form of the chromatin like RCC1 (regulator of chromosome 
condensation) [Renault, L. et al., Cell, 105:245-255 (2001)]. For example, in one method, 
antibodies that selectively bind to HDAC are coupled to protein A or protein G sepharose beads 
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and added to samples containing fragments of native chromatin under conditions amenable to 
immunoprecipitation, and the DNA fragments co-precipitated with HDAC are extracted and 
subcloned. These DNA fragments can then be either sequenced and/or used as probes to screen 
genomic libraries [Gould et al., Nature 348:308-3 12 (1990), the disclosure of which is incorporated 
5 herein by reference in its entirety]. 

In another embodiment, the invention relates to methods and compositions using the 
protein of the invention or fragment thereof as a marker protein to selectively identify tissues, such 
as salivary gland or placenta, or to distinguish between two or more possible sources of a tissue 
sample on the basis of the level of the protein of SEQ ID NO:72 in the sample. For example, the 

1 0 protein of SEQ ID NO:72 or fragments thereof may be used to generate antibodies using any 

techniques known to those skilled in the art, and the antibodies may then be used to identify tissues 
of unknown origin, for example, forensic samples, differentiated tumor tissue that has metastasized 
to foreign bodily sites, or to differentiate different tissue types in a tissue cross-section using 
immunochemistry. Typically, in such methods a tissue sample is contacted with the antibody, 

15 which may be detectably labeled, under conditions which facilitate antibody binding. In one 
embodiment, the level of antibody binding to the test sample is measured and compared to the 
level of binding expected from control cells from salivary gland and placenta, or tissues other than 
salivary gland and placenta, to determine whether the test sample is from salivary gland and 
placenta. Such methods may also be performed in conjunction with other, independant methods 

20 for determining cellular identity. Similar methods can be used to specifically detect cells 

expressing the protein, as well as to specifically isolate cells expressing the protein or to isolate the 
protein itself. For example, an antibody against the protein of SEQ ID NO:72 or a fragment 
thereof may be fixed to a solid support, such as a chromatography matrix. A preparation 
containing cells expressing the protein of SEQ ID NO:72 is placed in contact with the antibody 

25 under conditions which facilitate binding to the antibody. The support is washed and then the 
protein is released from the support by contacting the support with agents which cause the protein 
to dissociate from the antibody. 

A preferred embodiment of the invention relates to compositions or methods using the 
protein of SEQ ID NO:72 or fragment thereof to diagnose, treat and/or prevent disorders caused by 

30 the expression of genes whose transcription is regulated by the extent of local chromatin 

condensation. The number of pathologies and conditions that could be treated by the protein of the 
invention is potentially huge and unlimited. Favored disorders linked to dysregulation of gene 
transcription such as cancer and other disorders relating to abnormal cellular differentiation, 
proliferation, or degeneration, including leukemia, lymphomas, prostate hypertrophy, kidney 

35 diseases, kidney failures, viral infection especially HIV and viral hepatitis (i.e. expression of viral 
proteins), metabolic diseases such as obesity and a number of inflammatory diseases, for example 
due to interleukin over-expression. For diagnostic purposes, the expression of the protein of the 
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invention can be investigated using any method, for example Northern blotting, RT-PCR or 
immunoblotting methods, and compared to the expression in control individuals. For prevention 
and/or treatment purposes, the expression of the protein of the invention may be enhanced, 
inhibited, or otherwise altered in a patient using any of a number of methods, including gene 
5 therapy methods, or by administering a compound that enhances or inhibits the expression or 
activity of the protein. 

In one embodiment, the present invention provides a method for inhibiting the proliferation 
of a cell, the method comprising introducing into the cell the protein of the invention, linked to a 
heterologous protein domain that specifically targets the present protein to a cell-proliferation- 

10 regulating gene, wherein the targeting of the present protein to the gene results in local chromatin 
condensation and an inhibition in the expression of the gene. Cell-fusion proteins containing both 
the deacetylase activity and the specific DNA binding domain are obtained by methods of 
molecular biology well known to those skilled in the art. In one embodiment, such fusion proteins 
are introduced into the cell by transfecting the cell with a polynucleotide encoding the fusion 

15 protein, wherein the fusion protein is expressed in the cell. Such polynucleotides, e.g. in the form 
of expression vectors, which can thus be used, e.g., for gene therapy to treat or prevent cancer, 
metabolic disorders, aging and any disorder where a gene is over-expressed in association with 
local chromatin decondensation. Such recombinant cDNA may be introduced, for example, using 
in any vector, viral or non-viral, and viral vectors can be but not limited to retroviral, adenoviral, 

20 and adeno-associated vectors, which have been used in cancer therapy (Alemany et al., Nat. 
Biotechnol. 18:723-727 (2000)). Another approach is to administer a therapeutic amount of a 
polypeptide of SEQ DD:72, preferably in combination with a suitable pharmaceutical carrier. Such 
carriers include, but are not limited to, saline, buffered saline, dextrose, water, glycerol, ethanol 
and combinations thereof. 

25 In another embodiment, an array of oligonucleotides probes comprising the nucleotide 

sequence of SEQ ID NO:71 or fragments thereof can be constructed to conduct efficient screening 
of e.g., genetic mutations. The microarray can be used to monitor the expression level of large 
numbers of genes simultaneously and to identify genetic variants, mutations, and polymorphisms. 
This information may be used to determine gene function, to understand the genetic basis of a 

30 disorder, to diagnose a disorder, and to develop and monitor the activities of therapeutic agents 
(see for example: Chee, M. et al., Science, 274:610-614 (1996)). It has been shown that multiple 
classical features of cancer cells can be manifested by improper histone deacetylation [for review 
see Wade, P.A. Hum Mol Genet; 10(7):693-698 (2001)]. 

Another related embodiment relates to the use of SEQ ID NO:72, its complement, or any 

35 part thereof to develop antagonists of the protein of the invention and of the HDAC complex. 
Antagonists or inhibitors of histone deacetylase may indeed be used to suppress gene silencing. 
Such antagonists and/or inhibitors may be antibodies specific for the protein of the invention that 
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can be used directly as an antagonist, or indirectly as a targeting or deliveiy mechanism for 
bringing a pharmaceutical agent to cells or tissue which express the protein of the invention. Other 
methods to inhibit the expression of the protein of the invention include antisense and triple helix 
stategies as described herein. Other antagonists or inhibitors of the protein of the invention may be 

5 produced using methods which are generally known in the art, including the screening of libraries 
of pharmaceutical agents to identify those which specifically bind the protein of the invention. The 
protein of the invention, or fragment thereof, preferably its functional or immunogenic fragments, 
or oligopeptides related thereto, can be used for screening libraries of compounds in any of a 
variety of drug screening techniques. The fragment employed in such screening may be free in 

10 solution, affixed to a solid support, borne on a cell surface, or located intracellularly. The ■ 
formation of binding complexes, between the protein of the invention, or fragment thereof, or 
derivative thereof, and the agent being tested, may be measured. Another technique for drug 
screening which may be used provides for high throughput screening of compounds having 
suitable binding affinity to the protein of the invention as described in published PCT application 

15 WO84/03564. Abnormal gene silencing causes conditions like, but not limited to, accelerated 
aging syndromes such as Cochayne's syndrome, Ataxia telangiectasia and Werner's syndrome as 
well as age-associated diseases as well as "early onset" forms of diseases associated with old age 
such as dementia and Parkinson's disease. 

Protein of SEQ ID:74 (Internal designation Clone 500716683 J204-24-2-0-D12-F) 

20 The protein of SEQ ID NO:74, herein referred as short Paraplegin , comprising the amino 

acid sequence: 

MAVLLLLLRALRRGPGPGPRPLWGPGPAW 
LQSLQLRLLTPTFEGINGLLLKQ 

APEEDEGEFI, is encoded by the cDNA of clone 500716683_204-24-2-0-D12-F (SEQ ID NO:73). 

25 Accordingly, it will be appreciated that all characteristics and uses of the polypeptide of SEQ ID 
NO:74 described throughout the present application also pertain to the polypeptide encoded by a 
nucleic acid included in clone 500716683_204-24-2-0-D12-F. In addition, it will be appreciated 
that all characteristics and uses of the nucleic acid of SEQ ID NO:73 described throughout the 
present application also pertain to the nucleic acid included in clone 500716683_204-24-2-0-D12- 

30 F. A preferred embodiment of the invention is directed toward the compositions of SEQ ID 

NO:73, SEQ ID NO:74, and Clone 500716683_204-24-2-0-D12-F. Also preferred are polypeptide 
fragments having a biological activity as described herein and the polynucleotides encoding the 
fragments. 

The protein of SEQ ID NO:74 is encoded by a nucleic acid of 879 nucleotides with an 
35 ORF between nt 9 to 395 yielding a 129 amino acid protein. The protein is a variant of the 

sequence for human protease and associated protein-15 (PPRG-15) (described in PCT publication 
WO200009709-A2, the disclosure of which is incorporated herein by reference in its entirety) and 
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of the sequence of the protein associated to hereditary spastic paraplegia (described in PCT 
publication W09958556-A2, the disclosure of which is incorporated herein by reference in its 
entirety). It has a signal peptide spanning 17 amino acid residues at its N-terminal. The protein of 
SEQ ID NO:74 is localized in the brain and has a mitochondrial localizing signal peptide. 
5 Moreover, the protein of SEQ ID NO:74 exhibits high homology to the N-terminal of 

hereditary spastic paraplegia protein sequence (described in PCT publication W09958556-A2). 
Hereditary Spastic paraplegia (HSP) is characterized by progressive weakness and spasticity of the 
lower limbs due to degeneration of corticospinal axons. (Harding, A.E., J. Med. Genet. 18: 436- 
441(1981); Fink, J.K., et al., Am. J. Hum. Genet. 56:188-192 (1997); Reid, E., J. Med. Genet. 

10 34:499-503, (1997)). This is a genetically heterogeneous group of neurodegenerative disorders 
affecting approximately 1 in 10,000 individuals (Filla (1992); Polo et al. (1993)). Patients with 
HSP typically show leg stiffness and gait disturbance, decreased perception of sharp stimulation, 
and diminished vibratory sense in the distal lower limbs. Both the age of onset and severity of the 
symptoms are highly variable even among individuals from the same family (Harding, A.E., J. 

15 Med. Genet. 1 8: 436-441(1981); Durr et al., 1994). Currently, no specific treatment is available to 
prevent, cure, or delay progression of symptoms of HSP. 

In addition to the above-described clinical spectrum, which is typical of the "pure" form of 
HSP, several patients have been shown to have "complicated" forma of HSP characterized by the 
presence of additional neurological and non-neurological symptoms such as metal retardation, 

.20 peripheral neuropathy, amyotrophy, ataxia, retinitis pigmentosa, optic atrophy, deafness, and 
ichtyosis (Bonneau, D., et al., J. Med. Genet. 30:381-384 (1993); Gigli, G.L., et al, Am. J. Med. 
Genet. 45:711-716 (1993); Lizcano-Gil, L.A. et al., Am J; Med. Genet. 68:1-6 (1997); Webb, S., 
et al., Epilepsia 38:495-499 (1997)). Albeit some of these forms have been found to segregate in 
families, it is still unclear whether complicated forms of HSP represent distinct genetic entities or 

25 variant presentations of pure HSP. However, even in pure forma of HSP (i.e., with clinical 

features limited to the lower segments), a broader subclinical involvement of the nervous system 
has been demonstrated [Tedeschi, G. et al., J. Neurol. Sci. 103:55-60 (1991); Durr, A., et al., 
Neurology 44:1274-1277 (1994)]. 

Autosomal dominant, autosomal recessive, andX-linked forms of HSP have been 

30 described, indicating genetic heterogeneity [Harding, A.E., J. Med. Genet. 18: 436-441(1981); 
Fink, JJC., et al, Am. J. Hum. Genet. 56:188-192 (1997); Reid, E., J. Med. Genet. 34:499-503, 
(1997)]. Casari and collaborators have identified and characterized a gene associated to hereditary 
spastic paraplegia, located in the telomere region of chromosome 16q, and the protein deriving, 
therefrom, named paraplegin [Casari, G., et al. Cell 93:973,983 (1998)]. 

35 It is believed that the protein of SEQ ID NO:74, or fragment thereof is a mitochondrial 

protein associated to hereditary spastic paraplegia. The protein of the invention or fragment 
thereof may play a role in the mitochondrial degradation machinery. Preferred polypeptides of the 
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invention are polypeptides comprising the amino acids of SEQ ID NO:74 from positions 1 to 125. 
Other preferred polypeptides of the invention are fragments of SEQ ID NO:74 having any of the 
biological activities described herein. 

Another embodiment of the invention relates to compositions and methods using the 
5 protein of the invention or fragment thereof to label mitochondria in order to visualize any change 
in number, topology or morphology of this organelle, for example in association with a 
mitochondria-related human disorder, such as hereditary spastic paraplegia [Casari, G., et aL Cell 
93:973,983 (1998)], neuroleptic malignant syndrome (NMS) [Kubo et al., Forensic Sci. Int. 
1 15:155-158 (2001)], the Rett syndrome [Armstrong, Brain Dev. 14 Suppl:S89«98 (1992)], Alpers 

10 disease [Chow and Thorburn, Hum. Reprod. 15 Suppl 2:68-78 (2000)] or mitochondrial 

encephalomyopathies [Handran et al., Neurobiol. Dis. 3:287-298 (1997)]. Casari and collaborators 
have shown that paraplegin protein localizes to mitochondria by immunofluorescence studies 
[Casari, G., et al. Cell 93:973,983 (1998)]. Paraplegin protein exhibits a helical wheel pattern (an 
amphiphilic structure composed of basic residues, mainly arginine, on one side and apolar residues 

15 on the opposite side) of the N-terminal which is highly homolog to the protein of the invention; 
moreover the high ratio of arginine to lysin among the first 41 amino acids indicates the presence 
of typical mitochondrial leader sequences [Casari, G., et al. Cell 93:973,983 (1998)]. For example, 
the protein may be rendered easily detectable by inserting the cDNA encoding the protein of the 
invention into a eukaryotic expression vector in frame with a sequence encoding a tag sequence. 

20 Eukaryotic cells expressing the tagged protein of the invention may also be used for the in vitro 
screening of drugs or genes capable of treating any mitochondria-related disease or conditions. 
Another example, the protein of the invention or fragment thereof may be used to generate specific 
antibodies which would in turn allow the visualization of mitochondrial structures by methods 
well-known to those of skill in the art. 

25 In another embodiment, the protein of the invention may be used to target heterologous 

compounds (polypeptides or polynucleotides) to the brain and/or the mitochondria. For instance, a 
chimeric protein composed of the protein of the invention recombinantly or chemically fused to a 
protein or polynucleotide of therapeutic interest would allow the delivery of the therapeutic 
protein/polynucleotide specifically to the above-mentioned cellular/tissue targets (mitochondria, 

30 brain). Preferred fragments are the putative peptide signal, and/or any other fragments of the 
protein of the invention that may contain targeting signals for mitochondria). Such heterologous 
compounds may be used to modulate mitochondrial activities, such as to induce and/or prevent 
mitochondrial-induced apoptosis or necrosis. For example, these heterologous compounds may be 
used in the treatment and/or the prevention of disorders due to mitochondrial dysfunction, 

35 including, but not limited to, hereditary spastic paraplegia. In addition, heterologous 

polynucleotides may be used to deliver nucleic acids for mitochondrial gene therapy, i.e. to replace 
a defective mitochondrial gene and/or to inhibit the deleterious expression of a mitochondrial gene. 
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An antagonist of the protein of SEQ ID NO:74 may be produced using methods which are 
generally know in the art. In one aspect, the protein of the invention or fragment may be used to 
synthesize specific antibodies using any techniques known to those skilled in the art including 
those described therein. In particular, purified short paraplegin may be used to produce antibodies 
5 or to screen libraries of pharmaceutical agents to identify those which specifically bind short 
paraplegin. 

In a further embodiment, a pharmaceutical composition comprising a substantially purified 
protein of SEQ ID NO:74 in conjunction with a suitable pharmaceutical carrier may be 
administered to a subject to treat or prevent a disorder associated with change of expression or 

10 activity of short paraplegin including, but not limited to, those described above. The antibody 
which specifically binds short paraplegin may be used directly as an antagonist or indirectly as a 
targeting or delivery mechanism for bringing a pharmaceutical agent to cells or tissues which 
express short paraplegin. 

In another embodiment, antibodies which specifically bind the protein of SEQ ID NO: 74 

1 5 may be used for the diagnosis of disorders characterized by expression of short paraplegin. 

Truncated forms of paraplegin are involved in hereditary spastic paraplegia (Casari, G., et al. Cell 
93:973,983 (1998)). Diagnostics assays for short paraplegin include methods which utilize the 
antibody and a label to detect short paraplegin in human body fluids or in extract of cells or tissues. 
A variety of protocols for measuring short paraplegin, including ELISA's, RIAs, and FACs, are 

20 known in the art and provide a basis for diagnosing the presence of short paraplegin expression. 

In another embodiment, the polynucleotide of SEQ ID NO:73 or a fragment may be used 
for diagnostic purposes in assays that detect the presence of associated disorders, for example but 
not limited to,, hereditary spastic paraplegia. The polynucleotides which may be used include 
oligonucleotide sequences, complementary RNA and DNA molecules, PNAs. The polynucleotides 

25 may be used to detect and quantitate gene expression in biopsied tissues in which expression of 
short paraplegin maybe correlate with disease. The nucleotide sequences encoding short 
paraplegin maybe labeled by standard methods and added to a fluid or tissue sample from a patient 
under conditions suitable for the formation of hybridization complexes. After a suitable incubation 
period, the sample is washed and the signal is quantitated and compared with a standard value. If 

30 the amount of signal in the patient sample is significantly increased in comparison to a control 
sample then the presence of increased levels of nucleotide sequences encoding short paraplegin in 
the sample indicates the presence of associated disorder, particularly but not limited to, hereditary 
spastic paraplegia. Such assays may also be used to evaluate the efficacy of a particular 
therapeutic treatment regimen in animal studies, in clinical trials, or to monitor the treatment of an 

35 individual patient. 

Once the presence of a disorder is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to determine if the level of expression in 
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the patient begins to approximate that which is observed in the normal subject. The results 
obtained from successive assays may be used to show the efficacy of treatment over a period of 
time. 

In another embodiment, an array of oligonucleotides probes comprising the nucleotide 
5 sequence of SEQ ID NO:73 or fragments thereof can be constructed to conduct efficient screening 
of e.g., genetic mutations. The microarray can be used to monitor the expression level of large 
numbers of genes simultaneously and to identify genetic variants, mutations, and polymorphisms. 
This information may be used to determine gene function, to understand the genetic basis of a 
disorder, to diagnose a disorder, and to develop and monitor the activities of therapeutic agents 
10 [see for example: Chee, M. et al., Science, 274:610-614 (1996)]. For example, it has been shown 
that genetic variants, mutations, and polymorphisms are related to hereditary spastic paraplegia 
[for review see Casari, G., and Rugarli, E. Curr. Opin. Genetics and Development, 1 1 :336-342 
(2001)]. 

In another preferred embodiment, the protein of the invention or fragment thereof can be 
, 15 used in an enzyme/prodrug strategy to treat a number of pathologies, especially those treated with 
. drugs associated with severe side effects, including, but not limited to, autoimmune diseases and 
chronic inflammatory diseases such as rheumatoid arthritis, and cancer chemotherapy. These side 
effects can be mainly explained by the fact that the in vivo selectivity of the drugs used is too low 
(for example, the inadequate selectivity between tumor and normal cells of most anticancer drugs 

20 is well known and their toxicity to normal tissues is dose limiting). In the first phase of one 

example of such a protocol, a conjugate of the protein of the invention or fragment thereof and an 
antibody to a tissue specific antigen (for example, tumor specific antigens in the case of cancer 
chemotherapy) is administered. After a delay to allow residual enzyme conjugate to be cleared 
; from the blood, a relatively non-toxic compound is administered to the patient. This non-toxic 

25 compound is a substrate of the protein of the invention, and is converted by the protein into a 
substantially more toxic compound. Thus, because of the previous, targeted administration of the 
protein of the invention, when the non-toxic compound is administered, the toxic compound is 
only produced in the vicinity of the cells targeted by the fusion protein. This two-phase approach 
has been termed antibody-directed enzyme-prodrug therapy (ADEPT), this approach is reviewed 

30 by Melton et al. {Melton R. et al., J. Natl. Cancer Inst, 88, pl53-165 (1996)]. Alternatively the 
first phase can be replaced by a gene therapy approach resulting in the de novo synthesis of the 
protein of the invention or fragment thereof by cells from the targeted tissue, this has been termed 
gene-dependent enzyme/prodrug therapy (GDEPT). Another advantage of these 2 approaches 
(ADEPT and GDEPT) is that a single enzyme molecule is capable of activating many prodrug 

35 molecules. 

Protein of SEQ BD:76 (Internal designation Clone 500760207J05-58-4-0-H6-F) 

The protein of SEQ ID NO:76, herein referred as Ketothiolase (KT), comprising the amino 
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acid sequence: 
MMGVFWAAKRTPrc^ 

SSDAIYLARHVGLRVGIPKETPALTnSORLCGSGFQSI^ 
APYCVRNVRFGTKLGSDDaEDSLWVSLTDQHVQLPMA^ 
5 LQSQQRWKAAM)AGYFNDEMAPffiVKT^GKQTM 

DGTVTAGNASGVADGAGAVnASEDAVKKHNFTPLARIV GYFVSGCDPSMGIGPVPAISG 
ALKKAGLSLKDMDLVEVNEAFAPQYLAVERSLD^ 

AHLVHELRRRGGKYAVGSACIGGGQGIAVHQSTA, is encoded by the cDNA of clone 
500760207J205-58-4-0-H6-F (SEQ ID NO:75). Accordingly, it will be appreciated that all 

1 0 characteristics and uses of the polypeptide of SEQ ID NO:76 described throughout the present 
application also pertain to the polypeptide encoded by the human cDNA of clone 500760207_205- 
58-4-0-H6-F. In addition, it will be appreciated that all characteristics and uses of the nucleic acid 
of SEQ ID NO:75 described throughout the present application also pertain to the human cDNA of 
clone 500760207^205-5 8-4-0-H6-F. A preferred embodiment of the invention is directed toward 

15 the compositions of SEQ ID NO:75, SEQ ID NO:76, and Clone 500760207_205-58-4-0-H6-F. 
Also preferred are polypeptide fragments having a biological activity as described herein and the 
polynucleotides encoding the fragments. 

The protein of SEQ ID NO:76 encoded by the cDNA of SEQ ID NO:75 is a polymorphism 
variant of 3-Ketoacyl CoA Thiolase protein (GENPEPT accession number D16294). 

20 Furthermore, a BLAST search with the amino acid sequence of SEQ ID NO:76 indicates that the 
protein of the invention is homologous to 3-Ketoacyl CoA thiolase of rat (Swissprot accession 
number P13437) and Bacillus halodurans (Genbank accession number AP001514). 

The 394 amino acids protein of SEQ ID NO:76 displays 1 candidate membrane-spanning 
segment, from amino acids 373 to 393. Accordingly, some embodiments of the present invention 

25 relate to polypeptides comprising the transmembrane domain. Finally, the protein of the invention 
displays the 3 thiolase signatures (PS00098, PS00737, PS00099) spanning from positions 85 to 
103, positions 339 to 355, and positions 374 to 387, respectively. Accordingly, some embodiments 
of the present invention relate to polypeptides comprising the thiolase signature. 

Living organisms are exposed to a number of different fatty acids and their various 

30 derivatives arising either via endogenous synthesis or from exogenous sources. These hydrophobic 
compounds can play specific metabolic, structural or endocrinic functions in the organisms before 
their elimination, which can be metabolism to C0 2 or to more polar lipid metabolites allowing 
their excretion. Quantitatively, one of the major pathways metabolizing fatty acids is 0-oxidation, 
which is often described as a spiral of four reactions catalyzed by three enzymes. 

35 The three consecutive steps of mitochondrial P-oxidation of fatty acids, including the long- 

chain 3-hydroxyl-CoA dehydrogenase, are catalyzed by the trifunctional protein: 2-enoyl-CoA 
hydratase, 3-hydroxyacyl-CoA deshydrogenase and 3-ketoacyl-CoA thiolase. Deficiencies in 
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enzyme activities of the heterocomplex, which contains 4 alpha and 4 beta subunits, causes sudden 
unexplained infant death, a Reye-like syndrome, cardiomyopathy, or skeletal myopathy. 

Defects in the trifiinctional protein fall into two groups: patients with an isolated defect in 
3-hydroxyacyl-CoA dehydrogenase and those with a deficiency in all three activities and absence 
5 of immunoreactive protein (Tyni, J. et a]., Acta Paeditr. 88:237-245 (1999)). Patients in the second 
group have been found to have either deletions in the a-subunit cDNA encoding for 2-enoyl-CoA 
hydratase and 3 -hydroxy acyl-CoA deshydrogenase or point mutations in the P-subunit encoding 
for 3-ketoacyl-CoA thiolase fUshikubo, S. etal., Am. J; Hum. Genet 58: 979-988 (1996); Ori, K.R 
et al., Hum. Mol Genet. 6: 1215-1224 (1997)]. 

10 It is believed that the protein of SEQ ID NO: 76 or fragment thereof is an hydrolase, 

preferably acting on ester bonds, more preferably a thiolester hydrolase, even more preferably an 
ketoacyl-CoA thiolase which, as such, plays a role in fatty acid metabolism, in cellular vesicle 
transport and maintenance of the cytoarchitecture, in cellular proteolysis, endocytosis, signal 
transduction, lysosomal storage, cell proliferation and differentiation, immune and inflammatory 

15 response. The enzyme's substrates are compounds preferably containing an ester bond, preferably 
a thiol ester bond, more preferably an acyl thioester bond. Preferred polypeptides of the invention 
are polypeptides comprising the amino acids of SEQ ID NO: 76 from positions 85 to 103, positions 
339 to 355, and positions 374 to 387. Other preferred polypeptides of the invention are fragments 
of SEQ ID NO: 76 having any of the biological activities described herein. The hydrolytic activity 

20 of the protein of the invention or fragment thereof may be assayed using any of the assays known 
to those skilled in the art including those described in US patents 5,445,942. The ability to bind a 
cofactor may also be assayed using any techniques well known to those skilled in the art including, 
for example, the assay for binding NAD described in US patent 5,986,172. 

Another embodiment of the invention relates to compositions and methods using the 

25 protein of the invention or fragment thereof to label mitochondria, or more specifically the inner 
mitochondrial membrane, in order to visualize any change in number, topology or morphology of 
this organelle, for example in association with a mitochondria-related human disorder, such as 
neuroleptic malignant syndrome (NMS) (Kubo et al., Forensic Sci. Int. 115:155-158 (2001)), the 

■ Rett syndrome (Armstrong, Brain Dev. 14 Suppl:S89-98 (1992)), Alpers disease (Chow and 

30 Thorbum, Hum. Reprod. 15 Suppl 2:68-78 (2000)) or mitochondrial encephalomyopathies 
(Handran et al., Neurobiol. Dis. 3:287-298 (1997)). For example, the protein may be rendered 
easily detectable by inserting the cDNA encoding the protein of the invention into a eukaryotic 
expression vector in frame with a sequence encoding a tag sequence. Eukaryotic cells expressing 
the tagged protein of the invention may also be used for the in vitro screening of drugs or genes 

35 capable of treating any mitochondria-related disease or conditions. 

hi one embodiment, the invention relates to compositions and methods using the protein of 
SEQ ID NO: 76 or fragment thereof as a marker for tissue types (especially placenta), or to 
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distinguish between two or more possible sources of a tissue sample on the basis of the level of the 
protein of SEQ ID NO:76 in the sample. For example, the protein of SEQ ID NO:76 or fragments 
thereof may be used to generate antibodies using any techniques known to those skilled in the art, 
and the antibodies may then be used to identify tissues of unknown origin, for example, forensic 
5 samples, differentiated tumor tissue that has metastasized to foreign bodily sites, or to differentiate 
different tissue types in a tissue cross-section using immunochemistry. Typically, in such methods 
a tissue sample is contacted with the antibody, which may be detectably labeled, under conditions 
which facilitate antibody binding. In one embodiment, the level of antibody binding to the test 
sample is measured and compared to the level of binding expected from control cells from 

10 placenta, or tissues other than placenta to determine whether the test sample is from placenta. 
Such methods may also be performed in conjunction with other, independant methods for 
determining cellular identity. Similar methods can be used to specifically detect cells expressing 
the protein, as well as to specifically isolate cells expressing the protein or to isolate the protein 
itself. For example, an antibody against the protein of SEQ ID NO:76 or a fragment thereof may 

15 be fixed to a solid support, such as a chromatography matrix. A preparation containing cells 
expressing the protein of SEQ ID NO:76 is placed in contact with the antibody under conditions 
which facilitate binding to the antibody. The support is washed and then the protein is released 
from the support by contacting the support with agents which cause the protein to dissociate from 
the antibody. 

20 In another embodiment, the protein of the invention may be used to target heterologous 

compounds (polypeptides or polynucleotides) to the placenta and/or the cell mitochondria. For 
instance, a chimeric protein composed of the protein of the invention recombinantly or chemically 
fused to a protein or polynucleotide of therapeutic interest would allow the delivery of the 
therapeutic protein/polynucleotide specifically to the above-mentioned cellular/tissue targets 

25 .. (mitochondria, placenta). 

Another embodiment of the invention relates to composition and methods using 
polynucleotide sequences encoding the protein of the invention or fragment thereof to establish 
transgenic model animals (D. melanogaster, M. musculus), by any method familiar to those skilled 
in the art. By modulating in vivo the expression of the transgene with drugs or modifier genes 

30 (activator or suppressor genes), animal models can be developed that mimic human mitochondria- 
associated disorders such as myopathies or obesity. These animal models would thus allow the 
identification of potential therapeutic agents for treatment of the disorders. In addition, 
recombinant cell lines derived from these transgenic animals may be used for similar approaches 
ex vivo. 

35 In another embodiment, the invention relates to compositions and methods using the 

proteins of the invention or fragment thereof such as ligands for substrates of interest. In a 
preferred embodiment, the proteins of the invention or fragment thereof may be used to identify 
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and/or quantify substrates using any techniques known to those skilled in the art. To find 
substrates, the proteins of the invention, or fragment thereof, or derivative thereof, may be used for 
screening libraries of compounds in any of a variety of drug screening techniques. The fragment 
employed in such screening may be free in solution, affixed to a solid support, borne on a cell 
5 surface, or located intracellularly. The formation of binding complexes, between the proteins of 
the invention, or fragment thereof, or derivative thereof, and the agent being tested, may be 
measured. Antagonists or inhibitors of the proteins of the invention may be produced using 
methods which are generally known in the art, including the screening of libraries of 
pharmaceutical agents to identify those which specifically bind the protein of the invention. 

10 Another technique for drug screening which may be used provides for high throughput screening 
of compounds having suitable binding affinity to the proteins of the invention as described in 
published PCT application WO84/03564. 

In another embodiment, the invention relates to methods and compositions for detecting 
and quantifying the level of the protein of the invention present in a particular biological sample. 

15 These methods are useful for the diagnosis or prognosis of diseases associated with an altered 
levels of the protein of the invention like, but not limited to, deficiency of the hydrogenase activity 
(LCHAD deficiency). Diagnostic assays to detect the protein of the invention may comprise a 
biopsy, in situ assay of cells from organ or tissue sections, or an aspirate of cells from a tumor or 
normal tissue. In addition, assays may be conducted upon cellular extracts from organs, tissues, 

20 cells, urine, or serum or blood or any other body fluid or extract. 

Assays for the quantification of the KT of SEQ ID NO:76 may be performed according to 
methods well known in the art Typically, these assays comprise contacting the sample with a 
ligand of the protein of the invention or an antibody (polyclonal or monoclonal) which recognizes 
the protein of the invention or a fragment thereof, and detecting the complex formed between the 

25 protein of the invention present in the sample and the ligand or antibody. Fragments of the ligands 
and antibodies may also be used in the binding assays, provided these fragments are capable of 
specifically interacting with the KT of the subject invention. Further, the ligands and antibodies 
which bind to the KT of the invention may be labeled according to methods known in the art. 
Labels which are useful in the subject invention include, but are not limited to, enzymes labels, 

30 radioisotopic labels, paramagnetic labels, and chemiluminescent labels. Typical techniques are 
described by Kennedy, J. H., et al. (1976) Clin. Chim. Acta 70:1-31; and Schurs, A. H. et al. 
(1977) Clin. Chim. Acta 81: 1-40. 

Ia another ambodiment, the present invention includes the use of the protein of SEQ ID 
NO:76, or fragments having a desired biological activity to treat or ameliorate a condition in an 

35 individual For example, the condition may be deficiency of the hydrogenase activity (LCHAD 
deficiency), hypoglycemia, musculr hypotonia, hyperamonia, mild liver dysfunction, 3- . 
hydrixydicarboxylic aciduria, cardiomyopathy, retinal dystrophy, Bannayan-Riley-Ruvalcaba 
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syndrome, or an abnormality in any of the functions of the hydrogenase activity. In such 
embodiments, the protein of SEQ ID NO:76, or a fragment thereof, is administered to an individual 
in whom it is desired to increase or decrease any of the activities of the protein of SEQ ID NO:76. 
The protein of SEQ ID NO:76 or fragment thereof may be administered directly to the individual 
5 or, alternatively, a nucleic acid encoding the protein of SEQ ID NO:76 or a fragment thereof may 
be administered to the individual. Alternatively, an agent which increases the activity of the 
protein of SEQ ID NO:76 may be administered to the individual. Such agents may be identified by 
contacting the protein of SEQ ID NO:76 or a cell or preparation containing the protein of SEQ ID 
NO:76 with a test agent and assaying whether the test agent increases the activity of the protein. 

10 For example, the test agent may be a chemical compound or a polypeptide or peptide. 

Alternatively, the activity of the protein of SEQ ID NO:76 may be decreased by administering an 
agent which interferes with such activity to an individual. Agents which interfere with the activity 
of the protein of SEQ ID NO: 76 may be identified by contacting the protein of SEQ ID NO:76 or a 
cell or preparation containing the protein of SEQ ID NO:76 with a test agent and assaying whether 

15 the test agent decreases the activity of the protein. For example, the agent may be a chemical 
compound, a polypeptide or peptide, an antibody, or a nucleic acid such as an antisense nucleic 
acid or a triple helix-forming nucleic acid. 

In another embodiment, the invention also relates to the use of polynucleotides of SEQ ID 
NO:75 as diagnostic reagents. Detection of a mutated form of the gene characterized by the 

20 polynucleotide of SEQ ID NO:75 which is associated with a dysfunction will provide a diagnostic 
tool that can add to, or define, a diagnosis of a disease, or susceptibility to a disease. It has been 
shown previously that mutations in the beta subunit are responsible for trifunctional protein related 
diseases like those listed above [Ushikibo, S., et al, Am J. Hum. Genet. 58:979-988 (1996); Ori, 
K.F. et al., Hum. Mol Genet. 6:1215-1224 (1997)]. Individuals carrying mutations in the gene may 

25 be detected at the DNA level by a variety of techniques known by those skilled in the art. Nucleic 
acids for diagnosis may be obtained from a subject cells, such as from blood, urine, saliva, tissue 
biopsy or autopsy material. The genomic DNA may be used directly for detection or may be 
amplified enzymatically by using PCR or other amplification techniques prior to analysis. 

hi another embodiment, an array of oligonucleotides probes comprising the nucleotide 

30 sequence of SEQ ID NO: 75 or fragments thereof can be constructed to conduct efficient screening 
of e.g., genetic mutations. The microarray can be used to monitor the expression level of large 
numbers of genes simultaneously and to identify genetic variants, mutations, and polymorphisms. 
This information may be used to determine gene function, to understand the genetic basis of a 
disorder, to diagnose a disorder, and to develop and monitor the activities of therapeutic agents 

35 (see for example: Chee, M. et al., Science, 274:610-614 (1996)). 

Protein of SEQ ID NO:78 (Internal designation Clone 122421 JL05-076-4-0-H1-F) 

The cDNA of clone 122421__105-076-4-0-Hl-F (SEQ ID NO:77) encodes the protein of 
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SEQ ID NO:78, comprising the amino acid sequence: 
MAAAI^VLLGFALLGTHGASGA 

VVIJCEDALPGQKTEFK\TDSDDQWGEYSCVFLPEPMGTAMQLHGPPRV^ 
ETAMLVCKSESWPVTDWAWYKITDSEDKALMNGSE^ 
5 PGQYRCNGTSSKGSDQAHI^ 

DDAGSAPLKSSGQHQNDKGIONVRQRNSS. Accordingly, it will be appreciated that all 
characteristics and uses of the polypeptide of SEQ ID NO:78 described throughout the present 
application also pertain to the polypeptide encoded by the nucleic acids included in clone 
122421_105-076-4-0-Hl-F. In addition, it will be appreciated that all characteristics and uses of 

10 the nucleic acid of SEQ ID NO:77 described throughout the present application also pertain to the 
nucleic acids included in clone 122421_105-076-4-0-Hl-F. A preferred embodiment of the 
invention is directed toward the compositions of SEQ ID NO:77, SEQ ID NO:78, and Clone 
12242 l_105-076-4-0-Hl-F. Also preferred are polypeptide fragments having a biological activity 
as described herein and the polynucleotides encoding the fragments. 

1 5 The protein of SEQ ID NO:78 (BASE) is a novel polymorphic variant of human basigin. 

BASI2 displays a signal peptide (MAAALFVLLGFALLGTHG), and two immunoglobulin (Ig) 
domains 

(GSKILLTCSLNDSATEVTGHRWLKGGVVLKEDALPGQKTEF^ and 
GETAMLVCKSESWPVTDWAWYKITDSEDKALMNGSES 

20 DGQYRCNGTSS). Furthermore, BASK displays three N-glycosylation sites (NDSA, NGSE, and 
NGTS). The arginine at position 166 in basigin is changed to leucine in BASI2. Thus, the 
polymorphic, nonconservative change present in BASE is located in the second Ig domain, which 
is involved in protein-protein interactions. Such a polymorphic change located in the second Ig 
domain has never been previously reported. Thus, as a novel polymorphic variant of basigin, 

25 BASE displays similar biological activities as basigin, but displays enhanced kinetic parameters 
during protein-protein interactions. 

BASE is a member of the immunoglobulin superfamily, which includes T cell receptors, 
neural cell adhesion molecules and major histocompatibility complex antigens. BASE is a cell 
surface transmembrane glycoprotein that is broadly distributed, and expressed at particularly high 

30 levels on activated gliomas, on tumor cells, on activated T cells and at the retinal pigment 

epithelium and neonatal blood-brain barrier. BASE is involved in cell-cell interactions, and has a 
multiplicity of biological roles. Notably, BASE stimulates the biosynthesis of various matrix 
metalloproteinases (MMPs), a group of enzymes involved in the degradation of most of the 
components of the extracellular matrix. In particular, MMP biosynthesis is crucial in tumor 

35 secretion and in immune response. BASE plays a role in spermatogenesis and fertilization, in 
neuronal interactions in the central nervous system and in HTV-1 infection. 

An embodiment of the present invention relates to methods of using BASE or fragment 
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thereof to stimulate the biosynthesis of metalloproteinases. In a preferred embodiment, 
metalloproteinases produced by such methods can be used in a "cocktail" of proteases that is able 
to digest a wide range of proteins without knowing any of the proteins. Such protease cocktails are 
useful in laboratory assays to degrade undesirable proteins in a sample, for example for removing 
5 proteins in a DNA preparation or for removing enzymes after any enzymatic reaction. In another 
preferred embodiment, metalloproteinases produced by such methods can be used for screening 
and/or assaying metalloproteinases inhibitors. Such metalloproteinase inhibitors are very useful to 
treat and/or prevent a wide range of diseases associated to metalloproteinase activation. In still 
another preferred embodiment, metalloproteinases produced by such methods can be used for 

1 0 degradation of connective tissues, for example in food industry. Any method of stimulating 
metalloproteinases biosynthesis can be used in such methods. For example, fibroblasts can be 
stimulated as described by Guo et al (J Biol Chem 272:24-7 (1997 )), which disclosure is hereby 
incorporated by reference in its entirety. 

An embodiment of the present invention relates to methods of using BASE or fragment 

15 thereof for the diagnosis of cancers, graft rejections, and graft versus host diseases. In such 
methods, BASI2 or fragment thereof is used as a marker to detect and/or quantify cells in which 
BASI2 and/or basigin expression is up-regulated, and in which MMPs are synthetised. Any 
method of detecting the presence, level or activity of BASI2 and/or basigin can be used in such 
methods. For example, the protein of the invention or fragment thereof may be used to generate 

20 specific antibodies using standard methods. Preferably, the antibodies are either directly or 

indirectly labeled, and recognize the second Ig domain. In a preferred embodiment, the antibodies 
bind more specifically to BASE than to related proteins such as basigin. Such antibodies can be 
, used for specifically detecting the presence of the BASE variant. In another preferred 
embodiment, the antibodies recognize both basigin and BASE. Such antibodies can be used for 

25 .detecting total amount of basigin and BASE molecules. Alternatively, the nucleic acid of the 
invention or fragment thereof may be used to synthesize specific probes using any technique 
known to those skilled in the art. In such assays and diagnostic kits, the detection of a higher level 
of BASE and/or basigin expression, compared to a control representative of a non-malignant cell 
coming from a given tissue or bodily fluid, diagnostics the presence of a tumor or the beginning of 

30 a graft rejection reaction. 

Another embodiment of the present invention relates to compositions and methods for 
inhibiting the activity or expression of BASE in a patient for the treatment or prevention of 
disorders caused or aggravated as a result of metalloproteinase biosynthesis. The inhibition of 
. BASE activity or expression can be achieved using any suitable method, e.g. through 

35 administration of a therapeutically effective amount of an antibody that recognizes BASE or 
fragment thereof to a patient. Preferably, the antibodies are either directly or indirectly labeled, 
and recognize the second Ig domain. In a preferred embodiment, the antibodies bind more 
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specifically to BASI2 than to related proteins such as basigin. Such antibodies can be used for 
specifically inhibiting the BASE variant. In another preferred embodiment, the antibody 
recognizes both BASK and basigin. Such antibodies can be used for inhibiting both isoforms. 
The antibody can be administrated alone or in combination with one or more agent known in the 

5 art, e.g. ABX-CBL antibody described in PCT Patent WO99/45031. Administration of the 
antibody can be done following any method known in the art, including that described in PCT 
Patent WO99/4503 1 , which disclosure is hereby incorporated by reference in its entirety. Other 
inhibitors of BASI2 expression or activity which can be used include, but are not limited to, 
antisense molecules, ribozymes, dominant negative forms of BASH, and compounds that decrease 

10 the activity or expression of BASK in a cell. Such compounds can be readily identified, e.g. by 
screening test agents tumor cells overexpressing BASK, and detecting the ability of the test agents 
to decrease metalloproteinase biosynthesis or to diminish the level of BASK expression. Diseases 
and disorders caused or aggravated as a result of MMP biosynthesis and that can be treated by 
administrating an inhibitor of BASK and/or basigin include but are limited to cancers, graft 

15 rejections and graft versus host diseases. 

Still another embodiment relates to compositions and methods for inhibiting the expression 
or activity of BASK in a patient for the treatment or prevention of disorders caused or aggravated 
as a result of microglial activation. The inhibition of BASK activity or expression can be achieved 
using any of the methods described above. Disorders caused or aggravated as a result of microglial . 

20 activation include but are not limited to spinal cord contusion, Huntington disease, dementia with 
Lewy bodies, ischemia, multiple sclerosis, and Alzheimer's disease. 

Still another embodiment relates to compositions and methods for inhibiting the interaction 
between BASK and cyclophilin A (CyPA) in a patient, in order to treat or to reduce in severity 
HIV-infection. Inhibition of the interaction can be achieved using any suitable method, e.g. 

25 through administration of a therapeutically effective amount of an antibody that recognizes BASK 
or fragment thereof to a patient. Preferably, the antibodies are either directly or indirectly labeled, 
and recognize the first Ig domain and/or the second Ig domain. Administration of the antibodies 
can be performed as described above. 

Another embodiment of the present invention relates to compositions and methods for enhancing 
30 the expression or activity of BASK and/or basigin in a patient for the treatment or prevention of 
disorders caused or aggravated as a result of BASK and/or basigin deficiency such as sterility, 
learning and memory impairments, and retinal angiogenesis. Any method or composition 
enhancing the expression or activity of BASK and/or basigin, containing BASK or fragment 
thereof, a polynucleotide encoding the protein, or a compound that increases the expression or 
35 activity of BASK, can be used. Such compounds can be readily identified, e.g. by screening test 
agents against non activated glial cells expressing BASK and detecting the ability of the test agents 
to enhance metalloproteinases biosynthesis, or to increase the level of BASK expression. The 
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compositions of the invention can be administered directly to the patient using any suitable 
method, for example by intravenous perfusion or by oral administration. Effective doses of the 
polypeptides of the present invention are determined according to the relevant techniques. 
Protein of SEQ ID NO:80 (Internal designation 99483_105-016-l-0-D7-F) 
5 The cDNA clone 99483 J05-016-1-0-D7-F (SEQ ID NO:79) encodes KSPI1, the protein 

of SEQ ID NO:80, comprising the amino acid sequence: 
MLPPPRPAAALALPVLLL^ 
AAPRGCLAGRVRDACGCCW^ 
EWEPLCACRSQSPLCGSDGHTYSQO 

10 TWNWGQDVffGCEWAYPMASffiW 

QAVRPSDEGTYRCLGPMPWVKWRPLLA. Accordingly, it will be appreciated that all 
characteristics and uses of the polypeptide of SEQ ID NO:80 described throughout the present 
application also pertain to the polypeptide encoded by the nucleic acids included in clone 
99483_105-016-l-0-D7-F. In addition, it will be appreciated that all characteristics and uses of the 

1 5 nucleic acid of SEQ ID NO:79 described throughout the present application also pertain to the 
nucleic acids included in clone 99483_1 05-01 6-1 -0-D7-F. A preferred embodiment of the 
invention is directed toward the compositions of SEQ ID NO:79, SEQ ID NO:80, and Clone 
99483_105-016-l-0-D7-F. Also preferred are polypeptide fragments having a biological activity 
as described herein and the polynucleotides encoding the fragments.Preferred KSPI1 polypeptides 

20 for uses in the methods described below include the polypeptides comprising the amino sequence 
of: 

CAPCRPEECAAPRGCLAGRVRDACGCCWECANLEGQLCDLDPSAHFYGHCGEQL 

ECRIJDTGGDLSRGEWEPLCACRSQSPLCGSDGHTYSQICRLQEAARARPDANLW 

C and the polypeptide comprising the amino acid sequence of: 
25 WEPLCACRSQSPLCGSDGHTYSQICRLQEAARARPDA^ILWAHPGPC^ 

The protein of SEQ ID NO:80 (KSPI1) is a 267-amino-acid long protein, and is a new 

variant of the bA108L7.1 gene (Genbank accession number AL133215). The 255 first amino- 

acids are identical between the two proteins, but the 12 last amino-acids of KSPI1 are unique. 

KSPI1 displays a signal peptide (MLPPPRPAAALALPVLLLLLVVLTPPPTGA), a kazal-type 
30 serine protease inhibitor (Ki) domain 

(VPEPLCACRSQSPLCGSDGHTYSQICRLQEAARARPDANLWAHPGPQ an 

Immunoglobulin-like (Ig) domain 

(QDVIFGCEWAYPMASIEWRKDGIJ)IQLPGDDPHISVQFRGGPQRF 
DEGTYRCLG) and an Insulin-like growth factor-binding domain 
35 (CAPCRPEECAAPRGCLAGRVRDACGCCWECANLEGQLC). Furthermore, KSPI1 displays 
homologies with many Insulin-like growth factor-binding proteins (IGFBP) from positions 1 to 
255, and highest homology with a well-known IGFBP is obtained with human MAC25. 
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KSPI1 is a new Kazal-type serine protease inhibitor. Protease inhibitors are important 
tools of nature for regulating the proteolytic activity of their target proteases, for blocking these in 
emergency cases, or for signaling receptor interaction or clearance. Kazal-type serine proteases 
inhibitors have been shown to inhibit a number of serine proteases such as trypsin, elastase, 

5 acrosin, and thrombin. As these proteases are involved in major biological processes such as 
haemostasis, inflammation and apoptosis, their inhibitors may have a wide range of therapeutical 
applications (Dahlback, Lancet, 355:1627-32 (2000); Watorek et al., Adv Exp Med Biol, 240:23- 
31 (1988); Martin et al, Cell, 82:349-52 (1995)). 

Moreover, KSPI1 belongs to the low affinity IGFBP family. IGFBPs are soluble proteins 

10 that bind insulin-like growth factors (IGFs). IGFs are involved in the regulation of cellular growth 
and metabolism, and the principal function of IGFBPs is to regulate IGF availability in body fluids. 
Some serine protease inhibitors have been shown to be implicated in the activation of growth 
factors (Kawaguchi et al, J Biol Chem 272:27558-64 (1997)). As KSPI1 is a serine protease that 
belongs to the low affinity IGFBP family, KSP1 binds to IGFs and modulates IGF activation by 

15 inhibiting a serine protease. 

An embodiment of the present invention relates to methods of using KSPI1 or fragment 
thereof to inhibit contaminating proteases in a sample. In particular, KSPI1 can be used in a 
"cocktail 5 ' of protease inhibitors that is able to inhibit a wide range of proteases without knowing 
the specificity of any of the proteases. Such protease inhibitor cocktails are widely used in 

20 laboratory assays to prevent degradation of protein samples by contaminating proteases. 

In another embodiment, KSPI1 or fragment thereof can be used to treat and/or attenuate 
thrombin-mediated and thrombin-associated diseases. Thrombin is a serine protease that regulates 
the last step in the coagulation cascade, and has a central regulatory role in haemostasis and 
thrombus formation. Any compositions and methods containing, e.g., KSPI1 or fragment thereof, 

25 a polynucleotide encoding the protein, or a compound that increases the expression or activity of 
KSPI1 . Such compounds can be readily identified, e.g. by screening test agents against cells 
expressing KSPI1 and detecting the ability of the test agents to increase the level of KSPI1 
expression. A method for determining the ability of the polypeptides of the invention to block the 
proteolytic activity of thrombin is described in U. S. Patent 6,218,365. The compositions of the 

30 invention can be administered directly to the patient using any suitable method, for example by 
intravenous perfusion or by oral administration. The compositions of the invention can also be 
used in extracorporeal circuits, as necessary in dialysis and surgery. Effective doses of the 
polypeptides of the present invention are determined according to the relevant techniques. The 
compositions of the invention may be administered alone or in combination with other known 

35 agents inhibiting proteases of the coagulation cascade. Thrombin-mediated and thrombin- 
associated diseases in which the coagulation cascade is activated include but are not limited to 
deep vein thrombosis, pulmonary embolism, thrombophlebitis, arterial occlusion from thrombosis 
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or embolism, arterial reocclusion during or after angioplasty or thrombolysis, restenosis following 
arterial injury or invasive cardiological procedures, postoperative venous thrombosis or embolism, 
acute or chronic atherosclerosis, stroke, myocardial infarction. 

In another preferred embodiment, KSPI1 or fragment thereof can be used to treat and/or 

5 attenuate diseases associated with neutrophil-released proteases. Activated neutrophils release 
serine proteases such as elastase or cathepsin G, which result in abnormal connective tissue 
turnover and in severe damage to healthy tissues if not properly controlled [Watorek et al., Adv 
Exp Med Biol, 240:23-31 (1988)]. KSPI1 or fragment thereof inhibit neutrophil-released 
proteases, and inhibition efficiency of the polypeptides of the invention can be determined by 

10 measuring in vitro the apparent equilibrium dissociation constants using methods derived for tight- 
binding inhibitors [Bieth, Proteinase Inhibitors. 463-9 (1974); Williams et al, Methods Enzymol. 
63:437-67 (1979)]. The compositions of the invention can be administered directly to the patient 
using any suitable method, for example by intravenous perfusion or by oral administration. 
Effective doses of the polypeptides of the present invention are determined according to the 

15 relevant techniques. The compositions of the invention may be administered alone or in 

combination with other known agents inhibiting neutrophil-released proteases. Diseases caused by 
neutrophil-released proteases include but are not limited to emphysema, idiopathic pulmonary 
fibrosis, adult respiratory distress syndrome, cystic fibrosis, rheumatoid arthritis, organ failure, 
glomerulonephritis and various inflammatory diseases. 

20 Another embodiment of the invention relates to the inhibition and/or attenuation of 

proteases produced by pathogenic microorganisms. This embodiment relates to the administration 
of KSPI1 or fragment thereof, or a compound that increases the expression or activity of KSPI1, 
alone or in combination with other known agents, for preventing and/or treating parasitic infections 
in human, in animals and in cell cultures. It has previously been shown that protease inhibitors can 

25 prevent dissemination of a virus, a protozoa, a bacteria or a fungus in the host organism. Methods 
for determining the ability of the polypeptides of the invention to block the proteolytic activity of 
serine proteases from various pathogenic microorganisms, for preparing and evaluating the 
pharmaceutical compositions, and for administrating the compositions are described in U. S. Patent 
5,739,283, which disclosure is hereby incorporated in its entirety. Accordingly, the polypeptides 

30 of the present invention may be used to prevent or to treat, e.g., coccidiosis, staphylococcal 
infection, infection by the influenza virus, P. gingivalis or T. denticola, and invasive pulmonary 
aspergillosis. 

Another embodiment of the present invention relates to methods of using KSPI1 or 
fragment thereof to remove or to purify serine proteases in a sample. Such methods can be useful 
35 either for removing contaminating proteases from a sample or for purifying a given protease in a 
sample. Preferred polypeptides are KSPI1 in its entirety, polypeptides containing the Ki domain, 
and polypeptides containing the Ki and the Ig domains. Recombinant proteins that display the Ki 
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and/or Ig of KSPI1 may also be used. The binding efficacity of KSPI1 to a given serine protease 
can be tested using any suitable method, e.g., immunoprecipitation and Western blots analysis. 
Any method of binding KSPI1 to the protease and of purifying the complex can be used in such 
methods. In a preferred embodiment, KSPI1 or fragment thereof may be bound to a 
5 chromatographic support, either alone or in combination with other proteases inhibitors, to form an 
affinity chromatography column. The sample to analyse could then be run through this affinity 
chromatography column. 

Another embodiment of the present invention relates to methods of using KSPI1 or 
fragment thereof to detect and/or to quantify the amount of protease in a sample, and thus to use 

10 these methods in assays and diagnostic kits for the quantification of proteases in samples, bodily 
fluids or cell cultures. Such assays can be used to calculate the yield of a serine protease 
purification, and such diagnosis kits, which also contain a sample representative of the amount of 
protease found in a normal subject, can be used to detect diseases and disorders caused as a result 
of protease activity, including those listed above. Preferred polypeptides are KSPI1 in its entirety, 

15 polypeptides containing the Ki domain, and polypeptides containing the Ki and the Ig domains. 
Any method of detecting the protease inhibitor activity of KSPI1 or fragment thereof can be used 
in such methods. For example, the sample is assayed using a standard protease substrate. A 
known concentration of KSPI1 or fragment thereof is added, and allowed to bind to a particular 
protease present. The protease assay is then rerun, and the loss of activity is then correlated to the 

20 protease inhibitor activity using techniques well-known to those skilled in the art. 

Still another embodiment of the invention relates to compositions and methods for 
modulating IGF activity by decreasing binding of KSPI1 to IGFs. Compounds that inhibit the 
interaction of an IGF with any one of its binding proteins and not to a human IGF receptor are 
useful to increase serum and tissue levels of active IGFs in a mammal. Thus, compositions and 

25 methods for decreasing binding of KSPI1 to IGFs can be used, for example, in any treatments 
where IGFs are usually administrated, e.g., treatment of hyperglycemic, obesity-related, 
neurological, cardiac, renal, immunologic, and anabolic disorders. The inhibition and/or reduction 
of binding of KSPI1 to IGFs can be achieved using any suitable method, e.g. through the 
administration of a therapeutically effective amount of an antibody that specifically recognizes 

30 KSPI1 or fragment thereof to a patient. Preferably, two antibodies are used, separately or 
simultaneously, one recognizing the Ig domain and the other recognizing the Ki domain. The 
antibody can be administered alone or in combination with one or more agent known in the art, e.g. 
those described in U.S. Patent 6,251,865, which disclosure is hereby incorporated by reference in 
its entirety. Effective doses of the antibodies of the present invention are determined according to 

35 the relevant techniques. Decreased binding of KSPI1 to IGFs can also be obtained by using 

methods and compounds decreasing KSPI1 expression or activity. Such methods and compounds 
include, but are not limited to, antisense molecules, ribozymes, dominant negative forms of KSPI1 , 
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and compounds that decrease the activity or expression of KSPI1 in a cell. 

Another embodiment of the present invention relates to methods of using KSPI1 or 
fragment thereof to purify IGFs in a sample. Such methods of purifying IGFs can be used to 
analyse the different IGFs present in a patient suffering of one of the diseases listed above. The 
5 binding efficacity of KSPI1 to a given IGF can be tested using any suitable method, e.g., 

immunoprecipitation and Western blots analysis. Any method of binding KSPI1 to IGFs and of 
purifying the complex can be used in such methods, hi a preferred embodiment, KSPI1 or 
fragment thereof may be bound to a chromatographic support, either alone or in combination with 
other IGFBPs, to form an affinity chromatography column. The sample to purify could then be run 

1 0 through this affinity chromatography column. 

In another series of embodiments, KSPI1 or fragment thereof can be used to detect and/or to 
quantify IGFs in a sample. Such methods may then be used in assays and diagnostic kits for the 
quantification of KSPI1-IGF complexes in, e.g., bodily fluids or tissue samples. Any method of 
detecting the presence or level of KSPI1-IGF complexes can be used. In particular, the methods 

15 described by Khosravi et al [Clin Chem 43:523-32 (1997)] and in U.S. Patent 6,248,546, which 
disclosures are hereby incorporated by reference in their entirety, may be adapted, hi a preferred 
embodiment, the diagnosis kit, which contains a sample representative of the level of KSPI1-IGF 
complexes found in a normal subject, can be used to detect diseases caused as a result of impaired 
IGF level, or to monitor the effects of a treatment aiming to increase or decrease IGF level in a 

20 patient. 

Protein of SEQ ID NOS: 82 (Internal designation Clone 517778 J.84-5-3-0-G3-F) 

The cDNA of Clone 517778_184-5-3-0-G3-F (SEQ ID NO:81) encodes the Amyloid 
Apoptotic Receptor (AAR) protein comprising the amino acid sequence: 
MAGGVRPLRGLRALCRVLLFLSQFC3L 
25 SCTY GKPVTFDCAVKPS VTC VDQDFKSQ QLPETD YECTN STSCMTVS 

CPRQRYPANCTVRDHVHCL^ 

QWREGLGKLFSFGGLGIWTLIDVLLIGVGYVGPADGSLYI (SEQ ID NO:82). Accordingly, it 
will be appreciated that all characteristics and uses of the polypeptides of SEQ ID NO:82 described 
throughout the present application also pertain to the polypeptides encoded by the nucleic acids 

30 included in Clone 5 17778_1 84-5-3-0-G3-F. In addition, it will be appreciated that all ■ 

characteristics and uses of the polynucleotides of SEQ ID NO:81 described throughout the present 
application also pertain to the nucleic acids included in Clone 517778_184-5-3-0-G3-F. A 
preferred embodiment of the invention is directed toward the compositions of SEQ ID NO:81, 
SEQ ID NO:82, and Clone 5 17778JI 84-5-3-0-G3-F. Also preferred are polypeptide fragments 

35 having a biological activity as described herein and the polynucleotides encoding the fragments. 
AAR is a 221 amino acid receptor with two transmembrane segments. The resulting 
protein has a hydrophilic intracellular loop and extracellular amino- and carboxy-terminal ends. 
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The transmembrane domains and intracellular loop of AAR are very similar to the third and forth 
transmembrane domains and intervening sequence of seven transmembrane G protein coupled 
receptors. The G protein-binding amino acid sequence DRF is found near the first transmembrane 
region of AAR. The extracellular portion of AAR binds to ligands that include amyloidgenic 
5 peptides. Ligand binding leads to apoptosis for the AAR-expressing cell. AAR has biological 
activities that comprise binding G protein components and ligands such as amyloidgenic peptides. 
Preferred embodiments of the invention include: 

A method of preventing cell death wherein a ligand-binding polypeptide fragment of AAR 
is contacted with ligand in an amount effective to competitively inhibit ligand binding to AAR 

10 expressed on a cell. 

Preferred polypeptide fragments of AAR include but are not limited to those starting at an 
amino acid chosen from amino acids 1- 40 and ending at amino acid chosen from amino acids 165- 
180. The most preferred polypeptide fragment comprises amino acids 1-180 of AAR. 

Preferred forms of inhibited cell death include those associated with amyloidgenic 

15 peptides. A method of inducing apoptotic cell death wherein an AAR ligand is contacted with a 
cell in an amount effective to induce apoptosis of the cell. Preferred AAR ligands include 
amyloidgenic peptides. Further preferred AAR ligands are compounds that bind specifically to 
AAR and cause apoptosis in the cell expressing AAR. Further preferred AAR ligands include 
AAR-specific antibodies. Preferred AAR-specific antibodies include those that bind an epitope 

20 within the amino-terminal extracellular region of AAR. Preferred cells to be contacted with AAR 
ligand include neoplastic cells. Further preferred cells include neoplastic cells that express AAR. 

A preferred solution of AAR ligand further comprises a hydrogel-forming polymer 
solution to improve localization of delivery. A preferred solution of AAR ligand further comprises 
one or more alkaline salts to improve ligand.binding. A preferred solution of AAR ligand further 

25 comprises one or more chemotherapeutic agents to improve efficacy of treatment. A preferred 
method of contacting a cell includes catheter injection. 

A preferred method of contacting a cell further includes tumor imaging to improve 
accuracy of localized delivery. A preferred method of contacting a cell further includes computer 
modeling and administration of AAR ligand to improve accuracy of localized delivery. 

30 The amino-terminus of AAR is capable of binding to ligands such as amyloidgenic 

peptides (i.e., the P-amyloid peptide associated with Alzheimer's disease, Amyloid Precursor Like 
Proteins (APLP) 1 and 2, immunoglobulin light chain, prealbumin, P-2-microglobulin, 
transthyretin, amylin, insulin, atrial natriuretic peptide (ANP), apolipoproteins and glucagon). The 
amyloidgenic fragments of these proteins form predominantly beta-pleated sheet structures that 

35 may adopt the fibrillar configuration of amyloid in certain pathologic states. Amyloid deposits 
often lead to cell death in affected tissues. Amyloid-associated disorders include, most notably, 
Alzheimer's disease, diabetes, systemic amyloidosis, familial visceral amyloidosis, cutaneous 
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amyloidosis, Muckle-Wells syndrome, Gerstman-Straussler disease, dialysis-related and 
hemodialysis-related amyloidosis. Amyloid deposits may lead to further pathogenic outcomes 
depending on the affected tissue. For instance, hemodialysis-related amyloidosis can result in 
carpal tunnel syndrome, erosive arthropathy, spondyloarthropathy, lytic bone lesions, and 
5 pathologic fractures. P-amyloid peptide deposition in the tunica media of leptomeningeal and 
parenchymal vessels causes degradation of smooth muscle cells and subsequent cortical 
hemorrhages. Furthermore, the neuronal cell death observed in Alzheimer's disease is associated 
with the senility that accompanies the later stages of the disease and pancreatic Dp-islet cell death 
is a causative factor of disrupted insulin regulation in diabetes. Reducing the level of 

10 amyloidgenic peptides is a desired therapy for disorders such as those listed herein. 

In a preferred embodiment of the invention, a ligand-binding polypeptide fragment of 
AAR is used to prevent cell death. This method comprises the step of: contacting a ligand-binding 
fragment of AAR with ligand in an amount effective to competitively inhibit binding of ligand to 
AAR expressed on a cell. Preferred polypeptide fragments of AAR include but are not limited to 

15 those starting at an amino acid chosen from amino acids 1-40 and ending at an amino acid chosen 
from amino acids 165-180. Any single AAR fragment or combination of AAR fragments included 
in said list may be excluded from this embodiment of the invention. The most preferred fragment 
comprises amino acids 1-180 of AAR. Preferred forms of inhibited cell death include those 
associated with amyloidgenic peptides, such as pancreatic P-islet cell death and others listed 

20 herein. AAR fragments may be applied by methods common to the art such as those discussed 
herein. For example, AAR fragments may be delivered to cells of the pancreas in physiologically 
acceptable form by direct injection or catheter. For prolonged treatment, AAR fragments may be 
released from an implantable polypeptide-releasing stent (U.S. Patent 5683345 and U.S. Patent 
5500013, which disclosures are hereby incorporated by reference in their entireties). 

25 In the absence of ligand, AAR expression protects a cell from apoptotic cell death. AAR is 

expressed in many different cell types, including leukocytes and cells of the heart, brain, placenta, 
ovaries, testes, lung, liver, muscle, kidney, pancreas, colon, intestine, and prostate. Therefore, 
AAR may be exploited to cause cell death by addition of ligand. This inducible cell death is useful 
for treating neoplastic cell growth in a number of different tissues. As a preferred embodiment of 

30 the invention, an AAR ligand is used in a method to promote apoptotic cell death. This method 
comprises the step of contacting an AAR ligand with a cell in an amount effective to induce 
apoptotic cell death. Preferred AAR ligands include but are not limited to those listed herein (i.e., 
amyloidgenic peptides). Any single amyloidgenic peptide ligand or combination of amyloidgenic 
peptide ligands may be excluded from this embodiment of the invention. Further preferred AAR 

35 ligands are compounds that bind specifically to AAR and cause apoptosis in the cell expressing 
AAR, such as an AAR-specific antibody. Preferred antibodies for use in this method include those 
that bind an epitope within the amino-terminal extracellular region of AAR. Any single antibody 



271 



WO 02/094864 



PCT/ffiOl/01715 



or combination of antibodies that bind to an epitope of AAR may be excluded from this 
embodiment of the invention. Preferred cells to be contacted with AAR ligand include neoplastic 
cells including but not limited to: neoplastic leukocytes and neoplastic cells of the heart, brain, 
placenta, ovaries, testes, lung, liver, muscle, kidney, pancreas, colon, intestine, and prostate. 

5 Further preferred cells include those that express AAR. 

Delivery of AAR ligand to specific cells may be accomplished by methods common to the art such 
as those discussed herein. For example, an effective amount of AAR ligand in physiologically 
acceptable solution may be injected locally by syringe or catheter into a tumor mass to promote 
apoptotic cell death. AAR ligand may be used as the sole active agent in the solution, or may be 

1 0 used in combination with other chemotherapeutic drugs to increase the efficacy of treatment. A 
problem with direct delivery of AAR ligand into a solid tumor may be resistance of the tissue to 
the influx of the fluid. Increased penetration and/or reduced backflow through the point of entry, 
so that more material is introduced into and remains in the tumor, is obtained through the use of a 
viscous vehicle for the AAR ligand. Preferred materials include solutions or suspensions of a 

15 polymeric material which form a hydrogel at the time of or shortly after injection or implantation. 
The hydrogel solution of AAR ligand is injected via a catheter into regions of the tumor to be 
treated as described in U.S. Patent 5945100, which disclosure is hereby incorporated by reference 
in its entirety. Another problem with direct delivery of AAR ligand is that cancerous tumors 
generate localized areas of relatively high acidity due to a metabolic process known as "anaerobic 

20 glycolysis." This acidic environment may interfere with ligand binding to AAR. A 

physiologically acceptable solution of AAR ligand may therefore include a variety of alkaline 
salts, as described in U.S. Patent 5681857, which disclosure is hereby incorporated by reference in 
its entirety. To further increase the accuracy of treatment, tumor imaging, alone or in combination 
with computer modeling and administration of the AAR ligand solution may be employed (U.S. 

25 Patent 5438989 and U.S. Patent 5823993, which disclosures are hereby incorporated by reference 
in their entireties). 

Proteins of SEQ ID NOs:84, 86, and 98 (Internal designation Clones 100038_105-017-4-0-E4- 
F, 100523_105-019-l-0-F3-F, and 100545J05-019-2-0-E3-F) 

The cDNAs of Clones 100038J05-017-4-0-E4-F and 100523 J05-019-1-0-F3-F (SEQ ID 
30 NOs:83 and 85, respectively) encode the Soluble Activator of Wnt (SAW)-1 protein comprising 
the amino acid sequence: 
MLPPLPSRLGLLLLLLLCPAHV 
WAELARGARLGVRECQFQFRFRRWNCSS 

SRRRFQVPGPS (SEQ ID NOs;84 and 86). The cDNA of Clone 100545_105-019-2-0-E3-F 
35 (SEQ ID NO: 97) encodes the SAW-2 protein comprising the amino acid sequence: 
MIJTIJSRLGIXLLI^ 

WAELARGARLGVRECQFQFRFRRWNCSSHSKAFGI^ 
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GEENWFAEVA (SEQ ID NO:98). Accordingly, it will be appreciated that all characteristics and 
uses of the polypeptides of SEQ ED NOs:84, 86, and 98 described throughout the present 
application also pertain to the polypeptides encoded by the nucleic acids included in Clones 
100038J05-017-4-0-E4-F, 100523 J05-019-1-0-F3-F, and 100545_105-019-2-0-E3-F, 
5 respectively. In addition, it will be appreciated that all characteristics and uses of the 

polynucleotides of SEQ ID NOs:83, 85, and 97 described throughout the present application also 
pertain to the nucleic acids included in Clones 100038_105-017-4-0-E4-F, 100523_105-019-1-0- 
F3-F, and 100545_105-019-2-0-E3-F, respectively. A preferred embodiment of the invention is 
directed toward the compositions of SEQ ID NO:83, SEQ ID NO:84, SEQ ID NO:85, SEQ ID 

10 NO:86, SEQ ID NO:97, SEQ ID NO:98, Clone 100038 J05-017-4-0-E4-F, Clone 100523 J05- 
019-1-0-F3-F, and Clone 100545_J 05-019-2-0-E3-F. Also preferred are fragments having a 
biological activity described herein and the polynucleotides encoding the fragments. A preferred 
fragment of the polypeptides of SEQ ID NOs:84 and 86 comprises: 
MLPPLPSRLGLLLLLLLCPAH^ 

15 WAELARGARLGVRECQFQFRFRRWNCSSHSKAFGRILQQGQCGEG A 
preferred fragment of the polypeptides of SEQ ID NO: 98 comprises: 
MLPPLPSRLGLLLLLLLCPA 

WAELARGAIU1GVRECQFQF A 
further preferred fragment of the polypeptide sequences of SEQ ID NOs:84, 86, and 98 comprises: 
20 MHTLPSRLGLLLLLLL^ 

WAELARGARLGVRECQFQFRFRRWNCSSHSKAFGRILQQGQ. 
A list of preferred embodiments of the invention follows. 

A preferred embodiment is a composition, comprising a S AW-1 polypeptide sequence of 
SEQIDNO:84. 

25 A preferred embodiment is a composition, comprising a SAW-1 polypeptide sequence of 

SEQIDNO:86. 

A preferred embodiment is a composition, comprising a SAW-1 polypeptide fragment 
having biological activity. 

A preferred embodiment is a composition, comprising a SAW-2 polypeptide sequence of 
30 SEQIDNO:98. 

A preferred embodiment is a composition, comprising a SAW-2 polypeptide fragment 
having biological activity. 

A preferred embodiment is a composition, comprising a polynucleotide sequence of SEQ 
ID NO:83 encoding a SAW-1 polypeptide. 
35 A preferred embodiment is a composition, comprising a polynucleotide sequence of SEQ 

ID NO:85 encoding a SAW-1 polypeptide. 

A preferred embodiment is a composition, comprising a polynucleotide sequence encoding 
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a biologically active SAW-1 polypeptide fragment. 

A preferred embodiment is a composition, comprising a polynucleotide sequence of SEQ 
ID NO:97 encoding a SAW-2 polypeptide. 

A preferred embodiment is a composition, comprising a polynucleotide sequence encoding 
5 a biologically active SAW-2 polypeptide fragment 

A preferred embodiment is a method of increasing Wnt-dependent signaling to facilitate 
stem cell growth comprising the step of: contacting a SAW-1 or SAW-2 polypeptide or 
biologically active fragment thereof with a stem cell. 

Preferred stem cells include those capable of growth or proliferation in response to Wnt. 
10 Further preferred stem cells include those capable of giving rise to hematopoetic cells. 

Further preferred stem cells include those capable of giving rise to neuronal or neuroglial 

cells. 

Further preferred stem cells include those capable of giving rise to hepatocytes. 
Further preferred stem cells include those capable of giving rise to pancreatic cells. 
1 5 Further preferred stem cells include osteoblasts. 

Further preferred stem cells include chondroblasts. 

Further preferred stem cells include those found in cord blood. 

Also preferred is the addition of one or more cell-type specific growth factor to the stem 
cell before, during or subsequent to contact with SAW-1 or SAW-2 polypeptide. 
20 A preferred embodiment is a method of increasing Wnt-dependent signaling to prevent 

apoptosis comprising the step of: contacting a SAW-1 or SAW-2 polypeptide or biologically active 
fragment thereof with a cell at risk of apoptosis. 

Preferred cells are those capable of responding to Wnt. 
Preferably, the method is applied to prevent apoptosis of cells in culture. 
25 Preferably, the method is applied to treat an apoptosis-related disorder. 

Preferably, the method is applied to prevent an apoptosis-related disorder. 
A preferred apoptosis-related disorder is chosen from the list consisting of: 
neurodegenerative diseases, Spinal Muscular Atrophy (SMA) types I-IU, Amyltrophic Lateral 
Sclerosis (ALS), Huntington's disease, Alzheimer's disease, Parkinsons disease, retinal 
30 degeneration, retinitis pigmentosa, cerebellar degeneration, myelodysplasis, aplastic anemia, 
. ischemia-related degeneration, myocardial infarction, stroke, hepatic degeneration diseases, 
alcoholic hepatitis, hepatitis B, hepatitis C, fulminant hepatitis, joint degeneration diseases, 
osteoarthritis, and diabetes. 

SAW-1 and SAW-2 are splice variants of the Wnt-6 gene. In the case of SAW-1, the 135- 
35 nucleotide cassette inserted into the Wnt-6 cDNA encodes an early termination codon. The 

resulting SAW-1 polypeptide is 131 amino acids in length, compared to the 365-amino acid Wnt-6 
protein. In the case of SAW-2, a 236-nucleotide insertion also encodes for an early termination 
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codon. SAW-2 polypeptide is 129 amino acids in length and possesses a biological activity 
identical to that of SAW-1 . The Wnt family of proteins is crucial for determining cell polarity and 
fate, patterning of a number of tissues in the developing embryo, cell proliferation, and 
maintenance of stem cell populations throughout life. The role of Wnt proteins in promoting cell 
5 survival may explain the prevalence of Wnt overexpression in human cancers. Wnt proteins are 
secreted factors that generally associate with the extracellular matrix or cell surface. Receptors for 
Wnt proteins include the Frizzled (Fz) family of seven transmembrane spanning receptors and the 
low-density lipoprotein receptor-related proteins (LRP) 5 and 6. These receptors can act 
synergistically as Wnt coreceptors to transmit signals and upregulate target gene expression. 

10 Inhibitors of Wnt signaling include a soluble form of the Fz receptor, which acts as a competitive 
dominant negative inhibitor, and the extracellular factors Cerberus and Wnt-Inhibitory Factors 
(WIFs). Therefore, Wnt proteins are targets for multiple protein-protein interactions. SAW-1 and 
SAW-2 are novel, truncated splice variants of Wnt-6 that interact with Cerberus and WIF proteins. 
The biological activities of SAW-1 and SAW-2 are defined by those interactions. 

15 Wnt proteins are important in maintaining stem cell populations throughout adulthood. 

Stem cells comprise an undifferentiated or partially undifferentiated self-renewing population. As 
used herein, "stem cell" refers to any cell that retains undifferentiated character, is capable of self- 
renewal, and that gives rise to a further differentiated cell. These cells are important for renewing 
cell populations of nearly every type, especially the high-turnover populations of epithelial linings, . 

20 dermal layers, and the reproductive and hematopoetic systems. Defects in stem cell populations or .. 
drastic cell loss, whether caused by genetic predisposition, trauma, injury, disease, or medical 
treatments such as chemotherapy, have a disastrous effect on an individual. These defects may be 
overcome by stimulating growth of the remaining stem cell population in vivo. Alternatively, in 
vitro culture and transplantation of stem cells, preferably derived from the individual in need of 

25 treatment, but also from other sources such as cord blood, may be effective. Mature cells derived 
from the cultured stem cells may be transplanted as well. As Wnt proteins are effective growth 
and survival factors for stem cells, these proteins are useful for either strategy of cell replacement. 
However, Wnt proteins are difficult to purify in soluble form and do not diffuse readily, making 
Wnt-based treatments difficult to execute. A preferred method of increasing Wnt signaling is to 

30 decrease interaction of Wnt with soluble inhibitors such as Cerberus and WIF. 

In a preferred embodiment of the invention, a SAW-1 or SAW-2 polypeptide or 
biologically active fragment thereof is used to increase Wnt-dependent signaling and facilitate 
stem cell growth. This method comprises the step of contacting a SAW-1 or SAW-2 polypeptide 
or biologically active fragment thereof wife a stem cell. Preferred stem cells include those capable 

35 of growth or proliferation in response to Wnt Also preferred is the addition of one or more cell- 
type specific growth factors or cytokines in combination with SAW-1 or SAW-2 polypeptide. 
Examples include the interleukins (e.g., IL-3), granulocyte-macrophage colony-stimulating factor 
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(GM-CSF), macrophage colony-stimulating factor (M-CSF), granulocyte colony-stimulating factor 
(G-CSF), erythropoietin (Epo), lymphotoxin, steel factor (SLF), tumor necrosis factor (TNF) and 
gamma-interferon. IL-3 acts on multipotent stem cells as well as progenitors restricted to the 
granulocyte, macrophage, eosinophil, megakaryocyte, erythroid or mast cell lineages, while Epo 
5 acts on fairly mature erythroid progenitor cells. SAW-1 or SAW-2 polypeptide or a biologically 
active fragment thereof may be used to facilitate stem cell proliferation in culture by adding a 
physiologically acceptable solution comprising said polypeptide to a stem cell in culture (e.g., liver 
stem cells, neural or neuroglial stem cells, osteoblasts, chondroblasts, pancreatic stem cells, 
hematopoetic stem cells, cord blood, etc.) in an amount effective to promote Wnt-dependent 

10 growth or proliferation. A physiologically acceptable solution comprising a SAW-1 or SAW-2 
polypeptide or a biologically active fragment thereof may further be added upon transplantation or 
reintroduction of cultured cells into an individual to provide additional growth potential for the 
cells in vivo. Cell transplantation and reintroduction methods are determined by one skilled in the 
art and include injection of a single-cell suspension by syringe or catheter and surgical 

15 implantation (also see U.S. Patent 5869463 for neuroglial cell transplants; U.S. Patent 6068836 for 
bone marrow transplants; Noel et al., Metabolism, 31:1 84-7 (1 982) for pancreatic cell transplants; 
and U.S. Patent 4950296, U.S. Patent 5385566, and U.S. Patent 6200324 for bone transplants, 
which disclosures are hereby incorporated by reference in their entireties). Additionally this 
method may be applied to increase Wnt-dependent stem cell growth and proliferation in vivo. For 

20 example, a physiologically acceptable solution comprising SAW-1 or SAW-2 polypeptide or a 
biologically active fragment thereof may be directly injected to the site of interest (e.g., the bone 
marrow for hematopoetic stem cell treatment) or by other methods common to the art. 

Given that stem cells are, at the earliest stage, able to differentiate into almost any kind of 
mature, functional cell, a wide variety of conditions may be addressed by stem cell treatment. As 

25 an example, hematopoetic stem cell growth or replacement may benefit those predisposed to or 
suffering from, any one or more of the following exemplary conditions: lymphocytopenia; 
lymphorrhea; lymphostasis; immunodeficiency (e.g., HIV and AIDS); infections (including, for 
. example, opportunistic infections and tuberculosis (TB)); lupus; disorders characterized by 
lymphocyte deficiency, erythrocytopenia; erthrodegenerative disorders; erythroblastopenia; 

30 leukoerythroblastosis; erythroclasis; thalassemia; anemia (e.g., hemolytic anemia, such as 

acquired, autoimmune, or microangiopathichemolytic anemia; aplastic anemia; congenital anemia, 
e.g., congenital dyserythropoietic anemia, congenital hemolytic anemia or congenital hypoplastic 
anemia; dyshemopoietic anemia; Faconi's anemia; genetic anemia; hemorrhagic anemia; 
hyperchromic or hypochromic anemia; nutritional, hypoferric, or iron deficiency anemia; 

35 hypoplastic anemia; infectious anemia; lead anemia; local anemia; macrocytic or microcytic 
anemia; malignant or pernicious anemia; megaloblastic anemia; molecular anemia; normocytic 
anemia; physiologic anemia; traumatic or posthemorrhagic anemia; refractory anemia; radiation 
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anemia; sickle cell anemia; splenic anemia; and toxic anemia); myelofibrosis; thrombocytopenia; 
hypoplasia; disseminated intravascularcoagulation (DIC); immune (autoimmune) 
thrombocytopenic purpura (TTP); HIV inducted ITP; myelodysplasia; thrombocytotic diseases and 
thrombocytosis. Stem cells giving rise to neural or neuroglial cells may be applied to treat 
5 disorders including but not limited to: Alzheimer's disease, frontotemporal dementia, bipolar 
disorder, Huntington's chorea, multiple sclerosis, amyotrophic lateral sclerosis, Tay-Sach's 
disease, Gaucher's disease, and dopamine-related disorders such as Parkinson's disease and 
schizophrenia. Pancreatic stem cell cultures may be applied to treatment of metabolic disorders 
such as diabetes. Stem cells from bone tissue (osteoblasts) may be used to treat to bone loss, 
10 atrophy, or malformation due to injury, congenital or chronic conditions, osteopenia, osteoporosis, 
• . rickets, malignant melanoma-induced bone degradation, and bone fissures or fractures due to 
injury, elective surgery (e.g., plastic surgery), reconstructive surgery, and dental procedures or 
surgeries. 

Wnt proteins act to inhibit apoptosis and promote survival of Wnt-responsive cells. A 

15 specific activator of Wnt signaling is desirable both for cell or tissue growth in vitro and for 
. treating apoptosis-related disorders in vivo. Examples of such disorders include: 
neurodegenerative diseases such as Spinal Muscular Atrophy (SMA) types HE, Amyltrophic 
Lateral Sclerosis (ALS), Alzheimer's disease, Huntington's disease, Parkinson's disease, retinal 
degeneration, retinitis pigmentosa and cerebellar degeneration; myelodysplasis such as aplastic 

20 anemia; ischemic diseases such as myocardial infarction and stroke; hepatic diseases such as 
alcoholic hepatitis, hepatitis B, hepatitis C, and fulminant hepatitis; joint-diseases such as 
osteoarthritis; and metabolic disorders such as diabetes. In a preferred embodiment of the 
invention, a SAW-1 or SAW-2 polypeptide or biologically active fragment thereof is used to 
prevent apoptosis-related degeneration. This method comprises the step of contacting a SAW-1 or 

25 SAW-2 polypeptide or biologically active fragment thereof with a cell. Preferred cells are those 
capable of responding to Wnt Further preferred cells are those at risk of apoptosis. For example, 
a physiologically acceptable composition comprising SAW-1 or SAW-2 polypeptides may be 
added to a mixed culture of hippocampal neurons to improve cell survival in culture. 
Alternatively, a physiologically acceptable composition comprising SAW-1 or SAW-2 polypeptide 

30 or biologically active fragment thereof may be delivered to an individual diagnosed with or at risk 
of an apoptosis-related disorder, as determined by one skilled in the art. SAW-1 or SAW-2 
polypeptide may be used alone or in combination with agents that modulate Wnt signaling, 
apoptosis, or cell type-specific processes. Furthermore, SAW-1 or SAW-2 polypeptide may be 
fused to a ligand for the purpose of stabilizing and/or targeting said polypeptide (for example, 

35 tetanus toxin, calcium channel blocking agents, transferrin, poliovirus epitopes, neuropeptide 
fragments, or steroid hormone androgens, or fragments thereof which are sufficient for neuronal 
targeting). As an example, a physiologically acceptable composition comprising SAW-1 or SAW- 
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2 polypeptides may be delivered to an individual to prevent osteoartbritis-associated joint 
degeneration. As an additional example, said composition may be administered to an individual 
that has or is likely to experience an ischemic event. Appropriate delivery methods, such as those 
discussed herein, may be determined on a case by case basis by one skilled in the art. 
5 Protein of SEQ ID NOs: 88 and 90 (Internal designation Clone 116470_105-063-3-0-H7-F 
and Clone 122600_105-077-3-0-F9-F) 

The cDNAs of Clone 1 16470J05-063-3-0-H7-F and Clone 122600_105-077-3-0-F9-F 
(SEQ ID NOs: 87 and 89, respectively) encode the Dopamine AMPhetamine INhibitor (Dampin) 
protein comprising the amino acid sequence: 

10 MLFRLSEHSSPEEEASPHQRASGEGHHLKSKRP 

NQASEEEDELGELRELGYPREEDEEEEEDDEEEEEEEDSQAEVIXVIRQSAGQKTTCGQGL 
EGPWERPPPLDESERDGGSEDQVEDPALSEPGEEPQRPSPSEPGT (SEQ ID NOs:88 and 90). 
Accordingly, it will be appreciated that all characteristics and uses of the polypeptides of SEQ ID 
NOs:88 and 90 described throughout the present application also pertain to the polypeptides 

.1 5 encoded by the nucleic acids included in Clone 1 16470_105-063-3-0-H7-F and Clone 

122600_1 05-077-3-0-F9-F, respectively. In addition, it will be appreciated that all characteristics 
and uses of the polynucleotides of SEQ ID NOs:87 and 89 described throughout the present 
application also pertain to the nucleic acids included in Clone 1 16470_105-063-3-0-H7-F and 
Clone 122600_J05-077-3-0-F9-F, respectively. A preferred embodiment of the invention is 

20 directed toward the compositions of SEQ ID NO:87, SEQ ID NO:88, SEQ ID NO:89, SEQ ID 
NO:90, Clone 11 6470_1 05-063 -3 -0-H7-F and Clone 122600__105-077-3~0-F9-F. Also preferred 
are polypeptide fragments having a biological activity as described herein and the polynucleotides 
encoding the fragments. 

Dampin is a splice variant of the Dopamine and cAMP-Regulated PhosphoProtein-32 

25 (DARPP-32) that utilizes a different translation start site and lacks the first 37 amino acids of 
DARPP-32. DARPP-32 is a cytoplasmic signaling molecule that is regulated by phosphorylation 
at residues T34 by Protein Kinase A (PKA) to function as an inhibitor of Protein Phosphatase 1 
(PP1). This increases the effect of PKA on downstream targets. In neurons, PKA phosphorylates 
DARPP-32 in response to dopamine or psychoactive drugs that act on dopamine signaling 

30 pathways (e.g., cocaine and amphetamines). Alternatively, phosphorylation of T75 by Cdk5 
results in DARPP-32 inhibition of PKA. Dampin, as a splice variant, is not phosphorylated in 
response to PKA signaling and does not act as an inhibitor of PP1. However, Dampin has a Cdk5 
phosphorylation site and is able to inhibit PKA signaling. 

Abnormal signaling through dopaminergic pathways has been implicated in several major 

35 neurological and psychiatric disorders, including Parkinson's disease, Tourette's syndrome, 

Attention Deficit Disorder (ADD), Huntington's disease, schizophrenia, and drug/ alcohol abuse. 
In particular, cocaine and amphetamines activate the dopaminergic pathways through PKA. 
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Furthermore, addictive behavior is associated with increased dopaminergic signaling and PKA 
activity. Therefore, diminished PKA activity may be desired to address addictive behavior and 
drug and alcohol abuse. Increases in dopamine responses may be desired to treat disorders such as 
Parkinson's disease, Tourette's syndrome, ADD, Huntington's disease, and schizophrenia. 
5 Progesterone, similar to dopamine, also activates PKA, which leads to DARPP-32 

phosphorylation at T34 and inhibition of PP1. Dampin inhibits both dopamine and progesterone 
signaling by attenuating PKA activity. Progesterone is required for ovulation and implantation of a 
fertilized egg in the uterine wall. In addition, progesterone, in combination with dopamine 
increases female sexual receptivity. Therefore, high levels of Dampin relative to DARPP-32 

10 would be effective for female birth control as well as behavioral modification (e.g., for purposes of 
animal training). Alternatively, high levels of DARPP-32 relative to Dampin would be effective 
for increasing female fertility and sexual receptivity. 

PP1 activates glycogen synthase in response to insulin. Glycogen synthesis is one 
mechanism by which blood glucose levels are regulated by insulin. DARPP-32 inhibition of PP1 

15 is in turn inhibited by insulin. However, insufficient insulin or insulin resistance may lead to 

inappropriate inhibition of PP1 and dysregulation of blood glucose levels. Such dysregulation may 
result from disorders that include: Noninsulin dependent diabetes mellitus (MIDDM), Insulin 
dependent diabetes mellitus (IDDM), insulin resistance and insulin resistant disorders such as 
acanthosis nigricans, leprechaunism, and lipoatropahy. As Dampin does not inhibit PP1, high 

20 levels of Dampin relative to DARPP-32 would be effective for modulating blood glucose levels by 
increasing glycogen synthase activity. 

Preferred embodiments of the invention include: 

A composition comprising a Dampin polypeptide sequence of SEQ ID NOs:88 and 90. A 
composition comprising a Dampin polypeptide fragment having biological activity. A 
25 composition comprising a polynucleotide sequence of SEQ ID NOs:87 and 89 encoding a Dampin 
polypeptide. A composition comprising a polynucleotide sequence encoding a Dampin 
polypeptide fragment having biological activity. 

A method of screening test substances for modulators of Dampin expression comprising 
the steps of: i) contacting a cell with a test substance; and ii) comparing Dampin expression in the 
30 cell after exposure to the test substance to that of an unexposed control cell. 

A method of screening for test substances that modify the ratio of DARPP-32 relative to 
Dampin comprising the steps of: i) contacting a cell with a test substance; ii) comparing Dampin 
expression in the cell after exposure to the test substance to that of an unexposed control cell; 
iii) comparing DARPP-32 expression in the cell after exposure to that of an unexposed control cell; 
35 iv) quantifying said expression levels; and v) determining the level of DARPP-32 relative to 
Dampin in the exposed and unexposed cells. 

Preferably, the test substance modifies the ratio of Dampin relative to DARPP-32 in a 
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specific cell type while not in others. Further preferably, the test substance is conjugated to a cell 
type-specific ligand. Preferably, the method screens for test substances that decrease the ratio of 
DARPP-32 relative to Dampin. 

Alternatively, the method screens for test substances that increase the ratio of DARPP-32 
5 relative to Dampin. 

A method of differentiating Dampin polypeptides from DARPP-32 polypeptides 
comprising the steps of: i) contacting a first antibody that binds specifically to DARPP-32 and not 
Dampin with a protein sample; ii) contacting a second antibody that binds specifically to both 
DARPP-32 and Dampin with a protein sample; and iii) detecting protein-bound antibody. 
10 Preferably, the first and second antibodies are labeled with a different detectable conjugate. 
Preferably, the method follows immunohistochemical protocols. 

A substance that decreases the ratio of DARPP-32 relative to Dampin made by the process 
comprising the steps of: i) contacting a cell with a test substance; ii) comparing Dampin 
expression in the cell after exposure to the test substance to that of an unexposed control cell; 
15 iii) comparing DARPP-32 expression in the cell after exposure to that of an unexposed control cell; 
iv) quantifying said expression levels; v) determining the level of DARPP-32 relative to Dampin in 
the exposed and unexposed cells. 

Preferably, the substance decreases the ratio of DARPP-32 relative to Dampin in a specific 
cell type while not in others. Further preferably, the substance is contained in a physiologically 
20 acceptable composition. 

A substance that increases the ratio of DARPP-32 relative to Dampin made by the process 
comprising the steps of: i) contacting a cell with a test substance; ii) comparing Dampin 
expression in the cell after exposure to the test substance to that of an unexposed control cell; 
iii) comparing DARPP-32 expression in the cell after exposure to that of an unexposed control cell; 
25 iv) quantifying said expression levels; v) determining the level of DARPP-32 relative to Dampin in 
the exposed and unexposed cells. 

Preferably, the substance increases the ratio of DARPP-32 relative to Dampin in a specific 
cell type while not in others. Further preferably, the substance is contained in a physiologically 
acceptable composition. 

30 A method of screening for test substances that specifically bind to Dampin and prevent 

binding to PKA comprising the steps of: i) contacting a test substance with Dampin polypeptide in 
the presence of PKA, under conditions that allow binding of Dampin to PKA and ii) detecting the 
amount of PKA bound to Dampin in the presence and absence of the test substance. 

Preferably, the test substance is able to inhibit Dampin interaction with PKA in a certain 
35 cell type and not in others. Further preferable are test substances conjugated to cell-type specific 
Iigands or portions thereof. 

A substance that specifically binds to Dampin and prevents binding to PKA made by the 
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process comprising the steps of: i) contacting a test substance with Dampin polypeptide in the 
presence of PKA, under conditions that allow binding of Dampin to PKA and ii) detecting the 
amount of PKA bound to Dampin in the presence and absence of the test substance by methods 
common to the art. 

5 A method of decreasing PKA activity in a neuron comprising the step of contacting a 

substance capable of increasing the ratio of Dampin to DARPP-32 with a neuron. Preferably, the 
substance is capable of passing through the blood brain barrier. Preferably, this method is used to 
decrease cocaine- or amphetamine-dependent responses. Preferably, this method is used to 
diminish addictive behavior. Further preferably, this method is used to diminish alcohol addiction. 

10 A method of decreasing PKA activity in a cell of the female reproductive tract comprising 

the step of contacting a substance capable of increasing the ratio of Dampin to DARPP-32 with a 
cell of the female reproductive tract. Preferred cells include ovarian granulosa cells and luteal cells 
of the uterus. Preferably, this method is used to inhibit progesterone-dependent ovulation and 
implantation of a fertilized egg. Preferably, this method is used for female birth control. 

15 A method of modulating blood glucose levels comprising the step of contacting a 

substance capable of increasing the ratio of Dampin to DARPP-32 with a glycogen-storing cell. 
Preferred glycogen-storing cells include myocytes and hepatocytes. 

A method of inhibiting PKA activity comprising the step of introducing a Dampin 
polypeptide into a cell. Preferably, Dampin polypeptide is delivered to a cell by introducing a 

20 polynucleotide encoding Dampin polypeptide into the cell. Preferably, the polynucleotide is a 
polynucleotide construct comprising an expression control unit and a polynucleotide encoding 
Dampin polypeptide. Preferred cells include neurons, ovarian granulosa cells, uterine cells, 
hepatocytes, and myocytes. 

A method of increasing neuronal PKA activity comprising the step of contacting a 

25 substance capable of decreasing the ratio of Dampin to DARPP-32 with a neuron. Preferably, the 
substance is capable of passing through the blood brain barrier. Preferably, this method is used to 
increase PKA activity in dopaminergic neurons affected by neurological disorders. 

Preferred neurological disorders include: Parkinson's disease, Huntington's disease, ADD, 
Tourette's syndrome, and schizophrenia. Preferably, this method is used to increase PKA activity 

30 in hypothalamic neurons that express both dopaminergic and progesterone receptors. Preferably, 
increasing PKA activity in the hypothalamus is directed toward increasing sexual receptivity in a 
female individual. 

A method of increasing Atrial Natriuretic Factor (ANF) activity comprising the step of 
contacting a substance capable of decreasing the ratio of Dampin to DARPP-32 with a nephronic 
35 kidney cell. Preferably, this method is used to reduce blood volume. Further preferably, this 
method is used to reduce hypertension. 

An embodiment of the invention provides for a method of screening test substances for 
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modulators of Dampin expression. This method comprises the steps of: i) contacting a cell with a 
test substance; and ii) comparing Dampin expression in the cell after exposure to the test substance 
to that of an unexposed control cell. Dampin expression is determined by methods common to the 
art or included herein, by detecting Dampin polynucleotides or polypeptides. An example of this 
5 method comprises the steps of: i) culturing two equivalent cell samples; ii) adding a test substance 
to one of the cultures and not the other; iii) harvesting both cultures at a specified time; 
iv) purifying the mRNA from each sample of cells; v) comparing the level of Dampin mRNA in 
each sample by Northern blot, RTPCR, or another method common to the art. The invention 
provides for design and use of specific polynucleotide probes and primers, as discussed herein. An 

10 additional example comprises the steps of: i) having two equivalent cultures of cells; ii) adding a 
test substance to one of the cultures and not the other; iii) harvesting both cultures; iv) purifying 
the protein from each sample of cells; v) comparing the level of Dampin polypeptides in each 
sample by Western blot, immunohistochemistry, or another method common to the art. The 
invention provides for design and use of specific antibodies and antibody fragments, as discussed 

15 herein. 

A preferred embodiment of the invention provides a method of screening for test 
substances that modify the ratio of DARPP-32 relative to Dampin. This method comprises the 
steps of: i) contacting a cell with a test substance; ii) comparing Dampin expression in the cell 
after exposure to the test substance to that of an unexposed control cell; iii) comparing DARPP-32 

20 expression in the cell after exposure to the test substance to that of an unexposed control cell; 
iv) quantifying said expression levels; and v) determining the level of DARPP-32 relative to 
Dampin in the exposed (i.e., test) and unexposed (i.e., control) cells. 

A further preferred embodiment of the invention provides a method of screening for test 
substances that modify the ratio of DARPP-32 relative to Dampin in a specific cell type while not 

25 in others. Included in this method are test substances that are conjugated to cell-type specific 
ligands or portions thereof. For example, a test substance may be conjugated to a hydrophilic 
neuropeptide (e.g., interferon alpha, endorphin, somatostatin) for targeting to the brain (U.S. Patent 
4902505, which disclosure is hereby incorporated by reference in its entirety). 

A preferred embodiment of the invention provides a method of screening for test 

30 substances that decrease the ratio of DARPP-32 relative to Dampin. An alternative preferred 
embodiment of the invention provides a method of screening for test substances that increase the 
ratio of DARPP-32 relative to Dampin. 

In a preferred embodiment of the invention, a substance that decreases the ratio of 
DARPP-32 relative to Dampin is made by the process comprising the steps of: i) contacting a cell 

35 with a test substance; ii) comparing Dampin expression in the cell after exposure to the test 
substance to that of an unexposed control cell; iii) comparing DARPP-32 expression in the cell 
after exposure to that of an unexposed control cell; iv) quantifying said expression levels; 
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v) determining the level of DARPP-32 relative to Dampin in the exposed and unexposed cells. 
This substance is used for purposes discussed herein. 

In a preferred embodiment of the invention, a substance that increases the ratio of DARPP- 
32 relative to Dampin is made by the process comprising the steps of: i) contacting a cell with a 
5 test substance; ii) comparing Dampin expression in the cell after exposure to the test substance to 
that of an unexposed control cell; iii) comparing DARPP-32 expression in the cell after exposure to 
that of an unexposed control cell; iv) quantifying said expression levels; v) determining the level of 
DARPP-32 relative to Dampin in the exposed and unexposed cells. The substance that increases 
the relative level of DARPP-32 is used for purposes discussed herein. 

1 0 Methods of detecting Dampin polynucleotides and polypeptides may be used to detect 

DARPP-32 polynucleotides and polypeptides and are addressed herein (e.g., mRNA detection 
methods, antibody-based detection methods). A simple method for differentiating between the 
Dampin and DARPP-32 splice variants is desirable. A preferred embodiment of the invention 
provides a method for differentiating Dampin polypeptides from DARPP-32 polypeptides. This 

1 5 method comprises the steps of: i) contacting a first antibody that binds specifically to DARPP-32 
and not Dampin with a protein sample; ii) contacting a second antibody that binds specifically to 
both DARPP-32 and Dampin with a protein sample; and iii) detecting protein-bound antibody. 
Preferably, the first and second antibodies are labeled with a different detectable conjugate. This 
allows the method to be carried out with a single protein sample. Preferably, the protein sample is 

20 a fixed, semi-permeablized cell sample. Preferably, the detection method follows 
immunohistochemical protocols, as discussed herein. 

Dampin inhibits PKA by a competitive binding mechanism. Therefore, the inhibitory 
effect of Dampin may be ablated by a substance that blocks the interaction of Dampin with PKA. 
A preferred embodiment of the invention provides a method of screening for test substances that 

25 specifically bind to Dampin and prevent binding to PKA. This method comprises the steps of: i) 
contacting a test substance with Dampin polypeptide in the presence of PKA, under conditions that 
allow binding of Dampin to PKA (e.g., an intact cell); and ii) detecting the amount of PKA bound 
to Dampin in the presence and absence of the test substance by methods common to the art (e.g., 
antibody-based methods such as coimmunopreciptation and Western blotting). Preferably, the test 

30 substance is able to inhibit Dampin interaction with PKA in a certain cell type and not in others. 
Included in this method are test substances that are conjugated to cell-type specific ligands or 
portions thereof. 

In a preferred embodiment of the invention, a substance that inhibits Dampin binding to 
PKA is made by the process comprising the steps of: i) contacting a test substance with Dampin 
35 polypeptide in the presence of PKA, under conditions that allow binding of Dampin to PKA (e.g., 
a biological solution, preferably an intact cell); and ii) detecting the amount of PKA bound to 
Dampin in the presence and absence of the test substance by methods common to the art (e.g., 
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antibody-based methods such as coimmunopreciptation and Western blotting). 

In a preferred embodiment of the invention, a substance capable of increasing the ratio of 
Darrrpin to DARPP-32 is used in a method to decrease PKA activity in a neuron. Preferred 
substances are additionally capable of passing through the blood brain barrier in vivo. This method 
5 comprises the step of contacting a neuron with said substance. Diminished activity can be 
measured by an altered modulation of calcium channel function in response to dopamine, in situ. 
This diminished activity may also to be measured as a loss of dopamine-mediated inhibition of the 
sodium-potassium ATPase (Na,K ATPase) in situ or an increased excitability of striatal and 
cortical neurons. This method may also be applied to: i) diminish release of dopamine in response 

10 to amphetamines, as determined in situ; ii) diminish release of GAB A (4-Aminobutyric acid) in 
response to amphetamines, as determined in situ; iii) increase levels of substance P in the striatum 
and cortex, as determined in situ; iv) increase levels of neurotensin in the striatum and cortex, as 
determined in situ; v) attenuate increase in locomotor activity of an individual in response to 
cocaine; vi) attenuate increase in the protein Fos in response to an amphetamine, as determined in 

15 situ; vii) attenuate increase in the protein Chronic Fos Related Antigen (FRA) in response to 
cocaine, as determined in situ; and viii) diminish inhibition of the activity of the brain sodium- 
potassium-ATPase in response to dopamine, as determined in situ; and ix) decrease addictive 
behavior in an individual at risk of or displaying such behavior, as determined by family history or 
clinical assessment. 

20 In a preferred embodiment of the invention, a substance capable of increasing the ratio of 

Dampin to DARPP-32 is used in a method to decrease PKA activity in a cell of the female 
reproductive tract. This method comprises the step of contacting a cell of the female reproductive 
tract with said substance. Preferred cells include ovarian granulosa cells and luteal cells of the 
uterine tract. Dampin inhibits progesterone-mediated PKA activity, which is required for 

25 ovulation and implantation of a fertilized egg in the uterine wall. This method is directed toward 
female birth control. 

In a preferred embodiment of the invention, a substance capable of increasing the ratio of 
Dampin to DARPP-32 in glycogen-storing cells is used to modulate blood glucose levels. This 
method comprises the step of contacting said substance with glycogen-storing cells. Preferred 

30 cells include hepatocytes and myocytes. DARPP-32 inhibits PP1, which is required for glycogen 
synthase activity in response to insulin. Dampin does not inhibit PP1 and therefore will allow 
glucose processing and blood glucose modulation. This method is particularly useful for 
modulating glucose levels in insulin-deficient and diabetic individuals. 

As Dampin acts as a dominant negative inhibitor of DARPP-32, Dampin polypeptides may 

35 be expressed in a cell to inhibit PKA activity. In a preferred embodiment of the invention, a 

Dampin polypeptide or polynucleotide encoding said polypeptide in used to inhibit PKA activity in 
a cell. This method comprises the step of: introducing a Dampin polypeptide or polynucleotide 
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construct comprising an expression control unit operably linked to a Dampin-encoding 
polynucleotide into a cell. Preferred cells include but are not limited to: neurons, ovarian 
granulosa cells, uterine cells, hepatocytes, and myocytes. Methods of delivering a polypeptide or 
polynucleotide construct to a specific cell type are discussed herein. For example, a 

5 polynucleotide construct may be introduced to cells in culture by transfection, electroporation, or 
viral transduction, as commonly practiced in the art. As a further example, a polynucleotide 
construct may be introduced to a hepatocyte by packaging said polynucleotide construct into a 
liposomal vector; targeting the liposomal vector to the liver by embedding a hepatocyte-specific 
ligand in the membrane (e.g., hepatocyte growth factor); and introducing the liposome in a 

10 physiologically acceptable manner to an individual (e.g., orally or by injection). Preferably, this 
embodiment is directed toward: decreasing addictive behavior, especially in the case of alcohol 
addiction; reducing cocaine- or amphetamine-dependent responses; reducing progesterone- 
dependent ovulation and egg implantation; or increasing glycogen synthesis to control blood 
glucose levels, as discussed herein. 

15 In a preferred embodiment of the invention, a substance capable of decreasing the ratio of 

Dampin to DARPP-32 is used in a method to increase PKA activity in a neuron. Preferred 
compounds are additionally capable of passing through the blood brain barrier in vivo. This 
method comprises the step of contacting a neuron with said substance. Deficient dopaminergic 
signaling (and thus PKA activity) has been implicated in several major neurological and 

20 psychiatric disorders, including Parkinson's disease, Tourette's syndrome, ADD, Huntington's 
disease, and schizophrenia. As DARPP-32 is a vital downstream component the dopaminergic 
pathway, this method is preferably directed toward treatment of these disorders. Increased activity 
can be measured by an altered modulation of calcium channel function in response to dopamine, in 
situ, dopamine-mediated inhibition of the sodium-potassium ATPase (Na,K ATPase) in situ, an 

25 increased excitability of striatal and cortical neurons, or dopamine-mediated inhibition of brain 
sodium-potassium-ATPase activity, as determined in situ. Furthermore, this method may be used 
to increase sexual receptivity in a female individual. Preferred neurons for use in this method 
include hypothalamic neurons that express both the dopamine receptor and the progesterone 
receptor. Preferred individuals include breeding animals. Further preferred individuals include 

30 humans. 

In a further preferred embodiment of the invention, a substance that blocks the inhibition 
of PKA by Dampin is used in a method to increase PKA activity in a neuron. Preferred 
compounds are additionally capable of passing through the blood brain barrier in vivo. This 
method comprises the step of contacting a neuron with said substance. This method is directed 
35 toward treatment of neurological and psychiatric disorders, including Parkinson's disease, 

Tourette's syndrome, ADD, Huntington's disease, and schizophrenia. Furthermore, this method 
may be used to increase sexual receptivity in a female individual. Preferred neurons for use in this 
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method include hypothalamic neurons that express both the dopamine receptor and the 
progesterone receptor. Preferred individuals include breeding animals. Further preferred 
individuals include humans. 

DARPP-32 is required for proper Atrial Natriuretic Factor (ANF) activity in the kidney. 
5 ANF modulates blood sodium levels and reduces blood volume by inhibiting the renal sodium- 
potassium-ATPase, the sole active sodium transporter in the renal basolateral epithelia throughout 
the nephron. In a preferred embodiment of the invention, a substance capable of decreasing the 
ratio of Dampin to DARPP-32 expression is used in a method to activate ANF in a cell. Preferred 
cells are nephronic kidney cells. This method is applied to inhibit the activity of the renal sodium- 
1 0 potassium-ATPase in response to ANF, as determined in situ, and increase ANF-mediated sodium 
excretion in vivo. Preferably, this method is directed toward decreasing blood volume and 
hypertension. 

In a further preferred embodiment of the invention, a substance that blocks the inhibition of PKA 

by Dampin is used in a method to activate ANF in a cell. Preferred cells are nephronic kidney 
15 cells. This method is applied to inhibit the activity of the renal sodium-potassium-ATPase in 

response to ANF, as determined in situ, and increase ANF-mediated sodium excretion in vivo. 

Preferably, this method is directed toward decreasing blood volume and hypertension. 

Protein of SEQ ID NO:92 (Internal designation Clone 651658J81-35-2-0-C8-F) 

The cDNA of clone 651658J81-35-2-0-C8-F (SEQ ID NO:91) encodes the protein of 
20 SEQ ID NO:92, comprising the amino acid sequence: 

MPSSVSWGILLLAGLCCLW 

YRQLAHQSNSTNIFFSPVSIATAFAM^ 

RTLNQPDSQLQLTTGNGIJFL^^ 

VEKGTQGKIVDLVKELDR 
25 MMKRLGMFOTQHCKKLSSW 
. DRR5ASLHLPKLSITGTYDLKS 

DEKGTEAAGAMFLEAIPMSffPEVKFNKPFW 

Accordingly, it will be appreciated that all characteristics and uses of polypeptides of SEQ ID 
NO:92 described throughout the present application also pertain to the polypeptides encoded by the 

30 nucleic acids included in clone 651658_181-35-2-0-C8-F. In addition, it will be appreciated that 
all characteristics and uses of the polynucleotides of SEQ ID NO:91 described throughout the 
present application also pertain to the nucleic acids included in clone 65 1658_181-35-2-0~C8-F. A 
preferred embodiment of the invention is directed toward the compositions of SEQ ID NO:91 and 
SEQ ID NO:92. Also preferred are polypeptide fragments having a biological activity as described 

35 herein and the polynucleotides encoding the fragments. 

The cDNA of SEQ ID NO:91 is a novel variant of the human alpha 1 anti-trypsin protein 
named VAGS, encoded by a gene located on chromosome 14, specifically at position 14q32.1. 



286 



WO 02/094864 



PCT/IB01/01715 



The cDNA of SEQ ID NO:91encodes a 41 8 amino-acid protein of SEQ ID NO:92. 

Proteases are key components of a broad range of biological pathways and can be 
classified into four groups according to their catalytic mechanisms: the serine, cysteine (thiol), 
aspartic (carboxyl), and metalloproteases. VAGS displays serpin motif and thus belongs to the 
5 serine protease inhibitor family of protein named serpin. Serpins are irreversible suicide inhibitors 
of proteases that have a central role in regulating proteolysis in diverse physiological processes 
such as blood coagulation, fibrinolysis, complement activation, angiogenesis, apoptosis, 
inflammation, neoplasia and viral pathogenesis. VAGS neutralizes any trypsin formed 
prematurely within the cells by binding to its active site forming stable complexes with its target 

10 enzymes, which is a general property of serpin/serine protease interactions. VAGS is synthesized 
in the liver and, in response to inflammatory stimuli, inhibits the proteolytic enzyme neutrophil 
elastase, released from activated neutrophils at sites of inflammation. In hepatocytes, VAGS 
expression is increased by the cytokine interleukin-6 (IL-6). Synthesis of VAGS is tightly 
regulated by the net balance of neutrophil elastase and VAGS at sites of inflammation/tissue 

15 injury. Alterations of a serpin which can affect its functional levels may result in pathology. 
Congenital serpin deficiencies cause specific clinical syndromes such as thrombosis with anti- 
thrombin HI deficiency. Individuals with VAGS deficiency are susceptible to premature 
development of emphysema and liver diseases. In addition, changes in the balance between serine 
proteases and their cognate inhibitors may lead to pathological states similar to those associated 

20 with some neurodegenerative diseases such as Alzheimer's disease. 

In one embodiment, VAGS, or fragment thereof, provide an in vitro assay to test the 
specific sensitivity of various proteases to VAGS. The protease inhibitor activity of VAGS may be 
assessed using any techniques known to those skilled in the art including those disclosed in the US 
Patent 5,955,284, which disclosure is hereby incorporated by reference in its entirety. Possible 

25 substrates for the protein of the invention include, but are not limited to, serine proteases such as 
elastase, trypsin, chymotrypsin, thrombin HI, plasmin, heparin, complement II, plasminogen 
activator, protein C, interleukin-lbeta converting enzyme, preferably trypsin, elastase and 
chymotrypsin. Methods to assess the activity of such proteases inhibitors include the steps of 
contacting the inhibitor to be tested with one or several protease substrat in a competition system, 

30 and detecting the amount of inhibition of the present protein that occurs. Competitive system can 
also be used to determine the respective affinities of VAGS among all protease substrates. 

In another embodiment, VAGS, or fragment thereof, may be used to remove, identify or 
inhibit contaminating proteases in a sample. Compositions comprising the polypeptides of the 
present invention may be added to biological samples as a "cocktail" with other protease inhibitors 

35 to prevent degradation of protein samples. The advantage of using a cocktail of protease inhibitors 
is that one is able to inhibit a wide range of proteases without knowing the specificity of any of the 
proteases. Using a cocktail of protease inhibitors also protects a protein sample from a wide range 
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of future unknown proteases which may contaminate a protein sample from a vast number of 
sources. For example, the protein of the invention or fragment thereof are added to samples where 
proteolytic degradation by contaminating proteases is undesirable. Such protease inhibitor 
cocktails are widely used in assays to inhibit proteases susceptible of degrading a protein of 
5 interest for which the assay is to be performed. Alternatively, the protein of the invention or 
fragment thereof may be bound to a chromatographic support, either alone or in combination with 
other protease inhibitor, using techniques well known in the art, to form an affinity 
chromatography column. A sample containing the undesirable protease is run through the column 
to remove the protease. Alternatively, the same methods may be used to identify new target 

10 proteases of the protein of the invention. 

In one embodiment, VAGS, or fragment thereof, may be useful to quantify the amount of a 
given protease in a biological sample, and thus used in assays and diagnostic kits for the 
quantification of proteases in bodily fluids or other tissue samples, in addition to bacterial, fungal, 
plant, yeast, viral or mammalian cell cultures. In a preferred embodiment, the sample is assayed 

15 using a standard protease substrate. A known concentration of protease inhibitor is added, and 
allowed to bind to a particular protease present. The protease assay is then rerun, and the loss of 
activity is correlated to the protease inhibitor activity using techniques well known to those skilled 
in the art. Preferred proteases in this embodiment are seine protease, more preferably elastase, 
trypsin and chymotrypsin. 

20 In another preferred embodiment, VAGS, or fragment thereof may be used as anti- 

microbial agent useful to inhibit exogenous proteases implicated in a number of infectious diseases 
including, but not limited to, bacterial and parasite-borne infections. For example, protease 
inhibitors are able to inhibit growth of all strains of group A streptococci, including antibiotic- 
resistant strains. Accordingly, the present invention may be used to retard or inhibit the growth of 

. 25 certain microbes either in vitro or in vivo. 

The present invention provides a method for identify other molecules which specifically 
binds VAGS. For example, the composition of the balance proteases/proteases inhibitors of a 
diseased tissue can be determined by isolating the present protein under conditions that do not 
disrupt protein-protein interactions, and determining the identity of proteins associated with the 

30 present protein. Such associated proteins can be identified by any standard method including, but 
not limited to, immunoprecipitation and immuno-affmity columns. It can also comprise an 
investigation using the yeast-2-hybrid trap for identification of new interactions involving relevant 
targets of the present protein that could be implicated in some diseases affecting serpin biology. 
Another method can comprise the combination of the present protein with the library of molecules 

35 under conditions suitable to allow complex formation, and detecting complex formation, wherein 
the presence of the complex identifies a molecule which specifically binds the protein of the 
invention and that could be accumulated in some disorders. 
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In a further embodiment, VAGS provides a method of producing a recombinant serpin 
capable of effectively modulating serine protease activity. Despite the availability of human alpha 
1 anti-trypsin from serum, quantities large enough for therapeutic uses have been unobtainable, due 
in large part to the limited availability of human serum. Consequently, there is a great need for 
5 other sources of alpha 1 anti-trypsin to fill the needs created by therapeutic uses. In one preferred 
embodiment, milkers animal can be used to produce the protein of the invention in the milk, 
thereby generating a significant amount of this particular protein after purification. Any type of 
animal that produce enough quantity of milk can be used in this aim such as, but not limited to, 
sheep, goat, and cow. These animals can be generated with any method of targeting 

10 overexpression of the present protein in the milk. Also in this embodiment, the protein of the 
invention can be produced in host cells that have been transfected with an appropriate expression 
vector comprising a nucleic acid sequence coding for the present protein. The host cells are 
cultured under conditions whereby the nucleic acid sequence coding for this particular protein is 
expressed. After a suitable amount of time for the product to accumulate, the protein is purified 

15 from the host cells or medium surrounding the cells. Introduction of an expression vector 

incorporating a nucleic acid sequence coding for the protein of the invention into a host cell can be 
performed in a variety of ways, such as but not limited to calcium or lithium chloride treatment, 
electroporation, lipofection. 

In another embodiment, use of VAGS provides a method of effectively modulating serine 

20 proteases activity in cells. For example, the level or activity of the present protein can be increased 
in cells to decrease the rate or inhibit specific serine proteases by contacting the biological sample 
with an amount of the present protein sufficient to decrease the rate or inhibit specific serine 
proteases of one or more cells within the sample, or with a compound that increases the activity or 
expression of the present protein within one or more cells of the sample. Such methods can be 

25 performed either in vitro or in vivo. The level of the present protein can be increased in cells in 
any of a number of ways, including by administering purified protein to the cells, transfecting the 
cells with a polynucleotide encoding the protein, or administering a compound to the cells that 
causes an increase in the activity or expression of the protein. Alternatively, serine proteases level 
can be increase by decreasing the level of the present protein in cells, for example using antisense 

30 molecules, or more specifically inhibit the activity of the present protein using direct or indirect 
inhibitor molecules or antagonistic antibodies directed against the present protein. 

The present invention also provides animal models generated by modulating the expression 
or activity of the present protein in one or more tissues of the animal. Such animals are useful for a 
number of purposes, for example to assist with the study of the human alpha 1 anti-trypsin 

35 deficiency disease, because they represent an in vivo assay method for testing candidate molecules 
potentially useful for the treatment of various pathophysiological aspects of diseases specifically 
related to the activity of the present protein. Study of the phenotype of such models can also allow 
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the identification of additional human equivalent diseases caused by or linked with alpha 1 anti- 
trypsin deficiency. These animals can be generated with any method of targeting overexpression 
or inactivation of the present protein. Such models are extremely useful, e.g. in the assessment of 
candidate therapies and drugs for the treatment of inflammatory diseases and conditions. 

5 In other embodiment, VAGS, or fragment thereof, is used to diagnose diseases or disorders 

associated with altered expression or activity of the present protein. In particular, it is useful in 
diagnosing patients with deficient amounts of the present invention which results in uncontrolled 
activity of target proteases. Examples of such diseases and disorders include, but are not limited 
to, alpha 1 anti-trypsin deficiency associated disorders and more specifically liver diseases, or 

10 diseases associated with an excess level of elastase, such as rheumatoid arthritis, emphysema, and 
psoriasis. The method includes the steps of contacting a fluid or tissue sample obtained from an 
individual suspected of suffering from the disease or condition, or at risk of developing the disease 
or condition, with a compound capable of selectively binding the present protein or nucleic acids, 
e.g. a polyclonal or monoclonal antibody or any immunologically active fragment thereof, a 

15 nucleic acid probe, etc., and detecting the level, or any other detectable property of the present 
protein in the sample, where a difference in the level or other property in the sample relative to in a 
control sample indicates the presence of the disease or disorder, or of a propensity for developing 
the disease or disorder. In this embodiment, the identification of mutations using well known PCR 
or RT-PCR techniques and in particular in with real time PCR system that could facilitate 

20 diagnosis of such conditions. Alternatively, using such a method, the present invention provides a 
tool to correlate modulations in the expression of the specific variant of the invention with some 
pathologies which have never been linked to. Thus, the present invention provides a novel 
candidate gene for such conditions. 

A further embodiment of the present invention is to provide novel methods and 

25 compositions useful for the treatment of diseases and conditions related to the abnormal function • 
of proteases or their inhibitors. The VAGS, or fragment thereof, may be used to inhibit proteases 
implicated in a number of diseases where cellular proteolysis occur such as diseases characterized 
by tissue degradation preferably including, but not limited to, arthritis, muscular dystrophy, 
inflammation, tumor invasion, glomerulonephritis, parasite-borne infections, Alzheimer's disease, 

30 periodontal disease, and cancer metastasis. The methods and compositions can also be useful for 
treatment of septic shock, pancreatitis, coagulation disorders. In a more preferably embodiment, 
the invention relates to compositions and methods to use the protein of the invention or fragment 
thereof in diseases characterized by an abnormally elevated levels of trypsin, chymotrypsin, or 
elastase, including but not limited to, chronic emphysema of the lungs, cirrhosis, liver diseases, 

35 cystic fibrosis, and more specifically for alpha 1 anti-trypsin deficiency associated disorders such 
as aneurysm or toxic shock. In this embodiment, the present invention is preferably applied in the 
treatment of diseases associated with an excess level of elastase, such as rheumatoid arthritis, 
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emphysema, and psoriasis. Indeed, uncontrolled secretion of elastase which frequently results 
from aging of the cells or genetic defects may cause non-specific proteolysis and trigger 
destructive processes associated with those various chronic diseases. Such methods comprise the 
administration of a therapeutically-effective amount of the present protein to mammals suffering 
5 from the disease or condition, where "effective amount" is meant a concentration of the present 
protein which is capable of modulating the activity of serine proteases. The compositions of the 
invention are preferably delivered to the affected mammals in combination with a physiologically 
acceptable liquid, such as a saline solution or other buffer, or physiologically acceptable carrier. 
For treatment of skin inflammation, the compositions of the invention may be applied to the 

1 0 affected area in combination with a physiologically acceptable ointment or cream. The 
proportional ratio of active ingredient to pharmaceutical carrier will naturally depend on the 
chemical nature, solubility, and stability of the recombinant serine protease inhibitor. The 
particular amount of the compositions of the invention that will be administered to the mammal for 
any particular condition will depend on the clinical condition of the patient and the type of illness, 

15 and other factors such as the weight, age, the patient and route of delivery. Such composition can 
be administered by any suitable route including, but not limited to, intravenous, intramuscular, 
intraperitoneal, subcutaneous routes, and topically to an affected area of the skin or by absorption 
through epithelial or mucocutaneous linings such as nasal, oral, vaginal, rectal. Alternatively, for 
treatment purposes, the protein of the invention may be administrated using any of the gene 

20 therapy methods known in the art. These compositions can comprise the protein of the invention, 
and, optionally, one or more other types of protease inhibitors, or any other compound of interest. 
Indeed, in this embodiment, the present invention find use in drug potentiation applications. For 
example, therapeutic agents such as antibiotics or antitumor drugs can be inactivated through 
proteolysis by endogenous proteases, thus rendering the administrated drug less effective or 

25. inactive. Accordingly, the protease inhibitor of the invention may be administrated to a patient in 
conjunction with a therapeutic agent in order to potentiate or increase the activity of the drug. This 
co-administration may be by simultaneous administration, such as a mixture of the protease 
inhibitor and the drug, of by separate or sequential administrations. All of these components may 
be either obtained from natural sources or produced by recombinant genetic engineering 

30 techniques and/or chemical modification. 

Since the regulation of serine proteases by their inhibitors are critical for the control of tissue 
destruction in the diseases described above, in a further embodiment, VAGS, or fragment thereof 
provides an assay for the monitoring of markers in vivo for characterisation of disease states. The 
invention thus includes test kits useful for the quantification in a biological sample of the amount 

35 of the present protein. The kits comprise at least one immunological binding partner, e.g. a 
monoclonal or polyclonal antibody specific for the protein of the invention and coupled to 
detectable markers. In this embodiment, the application of such assays can be used to monitor the 
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progress of therapy administered to treat these or other conditions. Further, the assays can be used 
as a measure of toxicity, or during clinical testing of new drugs to assess the impact on tissue 
degradation. Thus the assays may be applied in any situation wherein the present invention can be 
used as an index of the condition, treatment, or effect of substances directly administered to the 
5 subject or to which the subject is exposed in the environment This marker may thus also play a 
role as prognostic indicators, preferably concerning inflammatory diseases. For example, it can be 
used in the Alzheimer's disease where chronic inflammation is an accompanying physiological 
contributor to this multifactor pathology. Also in a preferred embodiment, the present invention 
provides a method of detecting the presence and/or monitoring the metastatic progress of a 

10 malignancy. Indeed, metastatic potential can be influenced by the interaction between the 

neoplastic cells and their microenvironment such as extracellular matrix and proteolytic enzymes 
including the present protein. The invention thus includes test kits useful for quantify the amount 
of the present protein in a biological sample comprising the steps of contacting the biological 
sample with a specific monoclonal or polyclonal antibody specific for the present protein and 

15 coupled to detectable markers. Thus, the condition of a patient can be monitored continuously and 
the quantified amount of such proteins measured in the pathological sample can be compared with 
the amount quantified in a biological sample of a normal individual or with the previous analysis 
of the same patient. In all this embodiment, this marker can be measured effectively in plasma, 
serum or blood, by any suitable method, including immunoassays. It can also preferably be 

20 mesured in tissues and fluids recovered from inflammatory sites. Thus, the condition of a subject 
can be monitored continuously and the quantified amount of this particular protein measured in the 
pathological sample can be compared with the amount quantified in a biological sample of a 
normal individual. 

Polynucleotides of SEQ ID NO:93 (Internal designation Clone 150011_110-006-3-0-D5-F) 
25 and SEQ ID NO:95 (Internal designation Clone 500737461_205-43-3-0-E3-F) 

The cDNA of clone 15001 1_1 10-006-3-0-D5-F (SEQ ID:93) encodes an allele of Tissue 
Factor Pathway Inhibitor-1 (TFPI-1), comprising the nucleotide sequence: 
CTCnTTGCTCTAA 

GCTCTTTCACTGCTAGTAAGATCAGATTC 
30 TTTCTTGATCTGCTTCTAAAAG 

GGAAGGAAAAACAAAATAACCTCAACTCCGTITrG 

CATCAGAGATTTTACTTAGATGATTTACA 

TCTGTCCCTGCTGCTTAATCTTGCCCCTGCCCCT 
. AAGAACACACAATTATCACAGATACGGAGTTGC^ 
35 TTGTGCATTCAAGGCGGATGATAGCCCA^ 

AATATTTTCACTCGACAGTGCGAAGA^ 

ATCGAmGAAAGTCTGGAAGAGTGCAAAAAAAT 
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GGATTATAAAGACAACATTGCAACAAGAAAAGCCAGATTTCTGCTTTTTGGAAGAAG 
ATCCTGGAATATGTCGAGGTTATA1TACCAGGTATTTTTATAACAATCAGACAAAACA 
TGTGAACGTTTCAAGTATGGTGGATGCCTGGGCAATATGAACAATTTTGAGACACTGG 
AAGAATGCAAGAACATTTGTGAAGATGGTCCGAATGGTTTCCAGGTGGATAATTATGG 

5 AACCCAGCTCAATGCTGTGAATAACTCCCTGACTCCGCAATCAACCAAGGTTCCCAGC 
CTTTTTGTTACAAAAGAAGGAACAAATGATGGTTGGAAGAATGCGGCTCATATTTACC 
AAGTCTTTYTGAACGCCTTCTGCATTCATGCATCCATGTTCTTTCTAGGATTGGATAGC 
ATTTCATGCCTATGTTAATATTrGTGCTTTTGGCATTTCCTTAAT 
TGATGCCTTTGATAGCATACTGCTAATAAAGTTTTAATATTTACATGCATAGGAA 

10 AAAAAAAAAA. Accordingly, it will be appreciated that all characteristics and uses of the 
polypeptides of SEQ ID NO:94 and polynucleotides of SEQ ID NO:93 described throughout the 
present application also pertain to the nucleic acids included in Clone 15001 1_1 10-006-3-0-D5-F. 
Clone 15001 1_1 10-006-3-0-D5-F is alternatively referred to herein as TFPI-C16Pfs in reference to 
the nucleotide polymorphism that is a subject of the present invention. A preferred embodiment of 

15 the invention is directed toward the compositions of SEQ ID NO:93, SEQ ID NO:94, and Clone 
15001 1_1 10-006-3-0-D5-F. Also preferred are polypeptide fragments having a biological activity 
as described herein and the polynucleotides encoding the fragments. 

The cDNA of clone 500737461_205-43-3-0-E3-F (SEQ ID:95) encodes an allele of Tissue 
Factor Pathway frihibitor-1 (TFPI-1), comprising the nucleotide sequence: 

20 CTCTTTGCTCTAACAGACAGCAGCGACTTTAGGCT 
GCTCTrrCACTGCTAGTAAGATCAGA 
TTTCTTGATCTGCTTCTAAAAGAAGAAGT 
GGAAGGAAAAACAGAATAACCTCAACTCCGTITTGAAAAA 
. CATCAGAGATTTTACTTAGATGATTTACACAATGAAGAAAGTACATGCACTTTGGGCT 

25 TCTGTATGCCTGCTGCITAATCTTGCCCCTGCCCCTCTrAATGCTG 

TGAAGAACACACAATTATCACAGATACGGAGTTGCCACCACrGAAACTTATGCATTCA 
TTTTGTGCATTCAAGGCGGATGATGGCCCATGTAAAGCAATCATGAAAAGATTTTTCT 
TCAATAT1TTCACTCGACAGTGCGAAGAATTTATATATGGGGGATGTGAAGGAAATCA 
GAATCGATTTGAAAGTCTGGAAGAGTGCAAAAAAATGTGTACAAGAGATAATGCAAA 

30 CAGGATTATAAAGACAACATTGCAACAAGAAAAGCCAGATTTCTC 

AGATCCTGGAATATGTCGAGGTTATATTACCAGGTA1TTTTATAACAATCAGACAAAA 
. , CAGTGTGAACGTTTCAAGTATGGTGGATGCCTGK3GCAATCAACAATTTTGAGACACTG 
GAACAATGCAAGAACATITGTGAAGATGGTCCGAATGGTTTCCAGGTGGATAATTATG 
GAACCCAGCTCAATGCTGTGAATAACTCCCTGACTCCGCAATCAACCAAGGTTCCCAG 

35 CCTrTTTGAATTTCACGGTCCCTCATGGTGTCTCACTCCAGCAGACAGAGGATTGTGTC 
GTGCCAATGAGAACAGATTCTACTACAATTCAGTCATTGGGAAATGCCGCCCATTTAA 
GTACAGTGGATGTGGGGGAAATGAAAACAATTTTACTTCCAAACAAGAATGTCTGAG 
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GGCATGTAAAAAAGGTTTCATC 

AAGAAAAAGAAAGAAGCAGAGAGTGAAAATAGCATATGAAGAAATTm 
ATATGTGAATTTGTTATAGCAATGTAACATTAATTCTACTAAA 
TTTCACTATGATTTTCTA'll l'l'lCTT 
5 TTCTATGCTTATTGCAAAAAAAAAAAAAAAA. Accordingly, it will be appreciated that all 
characteristics and uses of the polynucleotides of SEQ ID NO:95 and polypeptides of SEQ 
IDNO:96 described throughout the present application also pertain to the nucleic acids included in 
Clone 500737461J2O5-43-3-0-E3-F. Clone 500737461_205-43-3-0-E3-F is alternatively referred 
to herein as TFPI-M162Qfs in reference to the nucleotide polymorphism that is a subject of the 

10 present invention. A preferred embodiment of the invention is directed toward the compositions of 
SEQ ID NO:95, SEQ ID NO:96, and Clone 500737461_205-43-3-0-E3-F. Also preferred are 
polypeptide fragments having a biological activity as described herein and the polynucleotides 
encoding the fragments. 

The extrinsic coagulation pathway is initiated on exposure of Tissue factor (TF) to plasma 

15 (McVey JH, Bailliere's Clinical Haematology 12:361-72 (1999) which disclosure is hereby 

incorporated by reference in its entirety). Tissue Factor Pathway Inhibitor-1 (TFPI-1) is a negative 
regulator of the extrinsic coagulation pathway (US Patent 5,849,875 which disclosure is hereby 
incorporated by reference in its entirety). 

TFPI-1 is a secreted trivalent Kunitz-type plasma proteinase inhibitor. TFPI-1 negatively 

20 regulates the initiation of coagulation through a mechanism of activated factor X (FXa) feedback 
inhibition of the catalytic complex of activated factor VII (FVHa) and TF. That is, TFPI-1 directly 
inhibits FXa and, in a FXa-dependent fashion, produces feedback inhibition of the TF-FVIIa 
complex by allosteric enablement of TF-FVIIa binding. TFPI-1 is the major inhibitor of the 
protease activity of the TF-FVIIa complex. The second Kunitz domain of TFPI-1 binds and 

25 inhibits FXa, whereas the first Kunitz domain is responsible for the inhibition of FVEIa in the TF- 
FVIIa complex. The function of the third Kunitz domain is unknown, although there is evidence 
that it contains a heparin binding site. Heparin binding site(s) have also been mapped carboxyl- 
terminal to the third Kunitz domain. 

Tissue factor pathway of coagulation plays a dominant role during normal haemostasis. 

30 TFPI-1, expressed primarily by the microvascular endothelium, appears to be the major 

physiologic inhibitor of TF-induced coagulation. TF-initiated coagulation also plays an important 
role in the pathophysiology of many diseases, including coronary thrombosis, disseminated 
intravascular coagulation, stroke, and atheriosclerosis. Several animal studies have found a 
beneficial effect of recombinant TFPI-1 in some of these clinical conditions. 

35 TFPI-1 plays an important role in modulating TF-dependent thrombogenesis. 

Recombinant full-length TFPI-1 prevents thrombosis formation and rethrombosis after lysis in a 
rabbit model of jugular vein thrombosis (Kaiser, B et al. Thromb. Haemost 76:615-20 (1996) 
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which disclosure is hereby incorporated by reference in its entirety). In a rat model of 
disseminated intravascular coagulation, TFPI-1 was found to inhibit thrombus formation (Elsayed, 
YA et al., Am. J. Clin. Pathol. 106:574-83 (1996) which disclosure is hereby incorporated by 
reference in its entirety). 

5 High levels of TF antigen and activity are detected in atheriosclerotic lesions, particularly 

in the advanced lesions. When the plaques are ruptured or eroded, exposure of cellular and 
extracellular TF to circulating blood plays a pivotal role in mediating fibrin-rich thrombus 
formation leading to acute coronary syndromes. Presence of TFPI-1 in atheriosclerotic plaques is 
associated with reduced tissue factor activity and reduced plaque thrombogenicity (Caplice, NM et 

10 al., Circulation 98:105 1-7 (1998); Badimon, JJ et al., Circulation 99: 1780-7 (1999) which 
disclosures are hereby incorporated by reference in their entirety). 

An recent study in mice using the gene knockout technology unambiguously established 
that deficiency of TFPI-1 promotes atheriosclerosis and thrombosis. In this work, it was found that 
TFPI-1 protects from atheriosclerosis and is an important regulator of the thrombosis that occurs in 

15 the setting of atheriosclerosis (Westrick, RJ et al, Circulation, 103:3044-6 (2001) which disclosure 
is hereby incorporated by reference in its entirety). Importantly, it was found that in this model 
inactivation of only one of the two copies of the TFPI-1 gene was sufficient to promote 
atheriosclerosis and thrombosis. 

Recently several amino acid polymorphisms have been identified for human TFPI-1 . A 

20 mutation at nucleotide position 1 of exon 7 results in the substituion of leucine for proline at 
position 179 (numbered from the initiating methionine of TFPI-1) (Kleesiek, K et al., Blood 
10:3976-7 (1998) which disclosure is hereby incorporated by reference in its entirety). This 
mutation occurs immediately downstream of Kunitz domain 2 (US Patent 5,849,875 which 
disclosure is hereby incorporated by reference in its entirety). This mutation has been found to be . 

25 statistically associated with a higher risk for venous thrombosis (Kleesiek, K et al., Thromb. 
Haemost. 82:1-5 (1999) which disclosure is hereby incorporated by reference in its entirety). 

A second amino acid polymorphism results in the substitution of methionine for valine at 
position 292 (numbered from the initiating methionine of TFPI-1). This mutation occurs very near 
the carboxy-terminus of TFPI-1. As might be expected for a mutation so far downstream of Kunitz 

30 domains 1 and 2, no link was found between this mutation and venous thromboembolic disease 
(Amaud, E et ah, Thromb. Haemost. 82:159-60 (1999) which disclosure is hereby incorporated by 
reference in its entirety). 

The cDNA of clone 15001 1 encodes the protein of SEQ ID NO:94. In the case of TFPI- 
C16Pfs, a deletion of two nucleotides in codon 16 (numbered from the initiating methionine of 

35 TFPM) results in the substitution of proline for cysteine and in the introduction of a frame-shift 
leading to premature termination of the protein within the signal sequence (exon 3). Specifically, 
whereas codon 16 of TFPI-l reads TGC (US Patent 5,849,875 which disclosure is hereby 
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incorporated by reference in its entirety), in TFPI-C16Pfs nucleotides T and G have been deleted. 
As protein TFPI-C16Pfs terminates upstream of Kunitz domains 1 and 2, the protein of SEQ ID 
NO:94 is nonfunctional. 

The cDNA of clone 500737461 encodes the protein of SEQ ID NO:96. Li the case of 

5 TFPI-M162Qfs, a deletion of two nucleotides in codon 162 (numbered from the initiating 
methionine of TFPI-1) and mutation of the remaining nucleotide results in the substitution of 
glutamine for methionine and in the introduction of a frame-shift leading to premature termination 
of the protein within Kunitz domain 2 (exon 6). Specifically, whereas codon 162 of TFPI-1 reads 
ATG (US Patent 5,849,875 which disclosure is hereby incorporated by reference in its entirety), in 

10 TFPI-M162Qfs two of the nucleotides have been deleted and the third changed to C. As protein 
TFPI-M162Qfs terminates within Kunitz domain 2, neither FXa binding nor the consequential 
enablement of TF-FVIIa-binding by Kunitz domain 1 occurs, leading to nonfunctional protein of 
SEQIDNO:96. 

The availability of informative genetic screenings and diagnostic markers for genetic 

15 - predisposition to thrombosis would be of considerable value. On one hand, said information can 
be used by the patient to make appropriate lifestyle changes. On the other hand, said information 
can be used by the physician to anticipate thrombotic complications that might arise in the course 
of clinical procedures. In both cases, said information results in health benefit to the patient and in 
reduced medical costs borne by the patient as well as by society in general. 

20 The nucleotide polymorphisms that are described herein for clones 1 500 1 1 (TFPI-C 1 6Pfs) 

and 500737461 (TFPI-M162Qfs) and that are the subject of the present invention lead to 
nonfunctional TFTI-1. There is clear evidence that having even just one of the two copies of the 
TFPI-1 gene inactivated predisposes the patient to atheriosclerosis and thrombosis. It follows 
therefore that the nucleotide polymorphisms described here within the coding region of TFPI-1 that 

25 lead to nonfunctional TFPI-1 have genetic screening and diagnostic value in identifying patients 
that are genetically predisposed to atheriosclerosis and thrombosis. 

In a preferred embodiment, the present invention provides for a method of diagnosing 
genetic predisposition to atheriosclerosis and thrombosis through the identification of a 
dinucleotide deletion in TFPI-1 codon 16 (numbered from the initiating methionine of TFPI-1). 

30 Methods of identifying such a dinucleotide deletion are well known to those skilled in the art and 
include, but are not restricted to, to PCR-SSCP (polymerase chain reaction followed by single- 
strand conformation polymorphism) (Kleesiek, K et al., Blood 10:3976-7 (1998) which disclosure 
is hereby incorporated by reference in its entirety). 

In a further preferred embodiment, the present invention is drawn to a method of 

35 determining if an individual is at increased risk of developing atheriosclerosis and thrombosis 
comprising the step of identifying a dinucleotide deletion in TFPI-1 codon 16 (numbered from the 
initiating methionine of TFPM), preferably using the method of PCR-SSCP, in a biological 
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sample, preferably blood, wherein said deletion indicates increased risk. 

In additional preferred embodiment, the present invention provides for a method of 
diagnosing genetic predisposition to atheriosclerosis and thrombosis through the identification of a 
dinucleotide deletion in TFPI-1 codon 162 (numbered from the initiating methionine of TFPI-1). 
5 Methods of identifying such a dinucleotide deletion are well known to those skilled in the art and 
include, but are not restricted to, to PCR-SSCP (polymerase chain reaction followed by single- 
strand conformation polymorphism) (Kleesiek, K et al., Blood 10:3976-7 (1998) which disclosure 
is hereby incorporated by reference in its entirety). 

In a further preferred embodiment, the present invention is drawn to a method of deteraiining if an 
10 individual is at increased risk of developing atheriosclerosis and thrombosis comprising the step of 
identifying a dinucleotide deletion in TFPI-1 codon 162 (numbered from the initiating methionine 
of TFPI-1), preferably using the method of PCR-SSCP, in a biological sample, preferably blood, 
wherein said deletion indicates increased risk. 

Protein of SEQ ID NO:100 (Internal designation Clone 479155_174-4-4-0-C8-F) 
15 The cDNA of clone 479155_174^-4-0-C8-F (SEQ ID NO:99) encodes the protein of SEQ 

ID NO:100 comprising the amino acid sequence 

MTVKGVASRTWSRPFPGNWLFS 

GTDWVKELIXTIPKYK^^ 

VEWEDRDDVLRNAMVNRKTGKFSMEVKKTVD 
20 Accordingly it will be appreciated that all characteristics and uses of polypeptides of SEQ ID 

NO: 100 described throughout the present application also pertain to the polypeptides encoded by 

the nucleic acids included in Clone 479155_174-4-4-0-C8-F. In addition, it will be appreciated 

that all characteristics and uses of the polynucleotides of SEQ ID NO:99 described throughout the 

present application also pertain to the nucleic acids included in Clone 479155_174-4-4-0-C8-F. A 
25 preferred embodiment of the invention is directed toward the compositions of SEQ ID NO:99, 

SEQ ID NO:100, and Clone 479155J74-4-4-0-C8-F. Also preferred are polypeptide fragments 
r having a biological activity as described herein and the polynucleotides encoding the fragments. 
The protein of SEQ ID NO: 100 encodes ADEVAR, a variant of calcium channel 

alpha2delta3 subunit resulting from alternative splicing. ADEVAR has novel function as 
30 described below. 

Alpha2delta3 subunit is a component of voltage-gated Ca2+ channels. The alpha2 subunit 
has several hydrophobic sequences, but biosynthetic studies indicate that it is an extracellular, 
extrinsic membrane protein attached to the membrane through disulfide linkage to the delta 
subunit. The delta subunit is encoded by the 3' end of the coding sequence of the same gene as the 
35 alpha2 subunit, and the mature forms of these two subunits are produced by post-translational 
proteolytic processing and disulfide linkage (Catterall, WA, Annu. Rev. Cell Dev. Biol. 16:521-55 
(2000) which disclosure is hereby incorporated by reference in its entirety). Alpha2delta3 subunit 
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is expressed exclusively in the brain (Klugbauer, N et al., J. Neuroscience 19:684-691 (1999) 
which disclosure is hereby incorporated by reference in its entirety). Alpha2delta3 subunit plays a 
role in regulating neuronal Ca2+ currents (Catteral, WA, Annu. Rev. Cell Dev. Biol., 16:521-55 
(2000); Stefani, A et al., Neuropharmacology 37:83-91 (1998) which disclosures are hereby 
5 incorporated by reference in their entirety). Alpha2delata3 subunit has been implicated in epileptic 
seizures (Gee NS et al., J. Biol Chem. 271:5768-76 (1996); Bryans JS et al., J. Med. Chem. 
41:1838-1845 (1998) which disclosures are hereby incorporated by reference in their entirely). 

ADEVAR is a product of alternative splicing leading to a soluble protein truncated at both 
it amino- and carboxyl-termini. ADEVAR plays a negative regulatory role in Ca2+ channel 

10 function. Diminished ADEVAR expression leads to dysregulated Ca2+ flux through the channel 
and reduced neuronal excitability. 

In a preferred embodiment, the present invention provides for an antibody that specifically 
binds ADEVAR of the present invention. Further preferred is a method of making said antibody 
wherein said antibody recognizes a non-conformational or conformational epitope of ADEVAR. 

15 Further preferred is a method wherein a mouse is immunized with ADEVAR. Further 

preferred is said immunization with ADEVAR, wherein ADEVAR is produced by recombinant 
DNA methodology. Further preferred is a method wherein monoclonal antibodies from said 
mouse are screened for binding to ADEVAR but not to full-length alpha2delta3 subunit. Further 
preferred is a method wherein monoclonal antibodies derived from said mouse are screened by 

20 enzyme-linked immunosorbent assay (ELISA) for binding to ADEVAR but not to full-length 
alpha2delta3 subunit. Methods of expressing protein by recombinant DNA methodology are well 
known to those skilled in the art. Methods of generating said monoclonal antibody and of 
establishing specificity by methods including ELISA are well known to those skilled in the art 
In a further preferred embodiment, the present invention provides for a method wherein 

25 said ADEVAR antibody is used in a method of quantitating ADEVAR in bodily fluid. Further 
preferred is a method of quantitating ADEVAR in bodily fluid, wherein the method of quantitation 
is a sandwich ELISA format. Further preferred is a method wherein said ADEVAR antibody is 
used to measure ADEVAR concentration in cerebrospinal fluid. In a preferred embodiment, the 
present invention provides for a method of contacting said antibody and specifically binding it with 

30 ADEVAR. Further preferred is a method for using said antibody diagnostically to stratify seizures 
and thereby add value to therapeutic strategies. Further preferred is a method of diagnosis, 
wherein reduced ADEVAR level is associated with predisposition to seizure in a subset of patients 
manifesting seizure. 

Protein of SEQ ID NO:102 (Internal designation Clone 586587 J81-9-2-0-C5-F) 

35 The cDNA of Clone 586S87J81-9-2-0-C5-F (SEQ ID NO: 101) encodes hABC of SEQ 

ID NO: 1 02, comprising the amino acid sequence: 
MACWPQLRLLLWKNLTFRRR 
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PSAGTLPWVQGHCNANNPCFRYP^^ 

KDMRKVLRTLQQIKKSSSRGDKRHFLNWQKGLKPLPQ Accordingly, it will be 
appreciated that all characteristics and uses of the polypeptides of SEQ ID NO: 102 described 
throughout the present application also pertain to the polypeptides encoded by the nucleic acids 

5 included in Clone 586587_181-9-2-0-C5-F. In addition, it will be appreciated that all 

characteristics and uses of the polynucleotides of SEQ ID NOs:101 described throughout the 
present application also pertain to the nucleic acids included in Clone 586587_181-9-2-0-C5-F. A 
preferred embodiment of the invention is directed toward the compositions of SEQ ID NO: 102, 
SEQ ID NO: 101, and Clone 586587J81-9-2-0-C5-F. Also preferred are polypeptide fragments 

10 having a biological activity as described herein and the polynucleotides encoding the fragments. 
hABC is a novel splice variant of the ATP-binding cassette 1 . As a splice variant, hABC is only 
162 amino-acid long whereas ABCA1 is 2261 amino acid long. hABC displays 100% identity 
with ABCA1 over its 140 amino-terminal residues, whereas the 22 carboxyl-tenninal amino acids 
are unique to hABC. hABC does not display the Walker A and B motifs nor the active transport 

15 signature. The 140 common amino acids correspond to the cytoplasmic amino-terminal tail of 
ABCA1 that plays a role in cholesterol-binding. Furthermore, hABC displays one transmembrane ' 
domain (TCQLLLEVAWPLFIFLILISV) and a "positively drophobic-polar" signal peptide that is 
required for translocation to the plasma membrane. Moreover, the hABC splice variant is 
specifically expressed in liver cells. Thus, hABC plays an important role in clearing HDL from the 

20 bloodstream by binding to HDL-cholesterol, thus allowing HDL-cholesterol import to liver cells 
where lipids are catabolized and excreted. 

An embodiment of the invention is directed to a composition comprising a hABC 
polypeptide sequence of SEQ ID NO: 102. 

A further embodiment of the invention is directed to a composition comprising a hABC 

25 polypeptide fragment having biological activity. 

An embodiment of the invention is directed to a composition comprising a polynucleotide 
sequence of SEQ ID NO:101 encoding a hABC polypeptide. 

A further embodiment of the invention is directed to a composition comprising a 
polynucleotide sequence encoding a hABC polypeptide fragment having biological activity. 

30 An embodiment of the invention is directed to a composition comprising a polynucleotide 

sequence that yields an RNA that is complementary to the sequence of SEQ ID NO: 101 encoding a 
hABC polypeptide. 

A further embodiment of the invention is directed to a composition comprising a 
polynucleotide sequence that yields an RNA that is complementary to a polynucleotide sequence 
35 encoding a hABC polypeptide fragment. Preferred such a polynucleotide sequence is the 
polynucleotide sequence that yields an RNA that is complementary to 
GAGGGGACAAACGCCATTTCCTCAACTGGCAGAAGGGACTGAAGCCTCT 
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CCCTTITA. 

A further embodiment of the invention is directed to compositions comprising an antibody 
directed against a hABC polypeptide or against a hABC polypeptide fragment having the same 
biological activity. Preferably, the antibody specifically binds to the hABC polypeptide or and not 
5 to the ABCA1 polypeptide. Even more preferably, the antibody recognizes the 

LQQIKKSSSRGDKRHFL amino-acid sequence or the RHFLNWQKGLKPLP amino-acid 
sequence. 

An embodiment of the present invention relates to methods of measuring the circulating 
HDL-cholesterol in bodily fluids. Methods of detecting measuring the circulating HDL- 

10 cholesterol comprise the steps of i) labeling by standard methods of the hABC polypeptide with a 
molecule which can be used to provide a quantifiable signal, ii) addition of this probe, under 
conditions suitable for the formation of hybridization complexes, to a fluid obtained from a patient 
and to control fluids containing a known amount of HDL-cholesterol, iii) washing of the samples, 
after a suitable incubation period, in order to remove all hABC polypeptides that are not 

15 complexed with HDL-cholesterol, and iv) comparison of the resulting signal with control samples 
containing a known amount of HDL-cholesterol. Such methods can be used in diagnostic kits for 
detecting diseases associated with low circulating HDL-cholesterol level, for evaluating the 
efficacy of a particular therapeutic treatment regimen in animal studies and in clinical trials, and 
for monitoring the treatment of an individual patient. The binding efficiency of hABC to HDL- 

20 cholesterol can be determined using any technique familiar to those skilled in the art, e.g. using the 
assay described in U.S. Patent 5,962,322, which disclosure is hereby incorporated by reference in 
its entirety. 

An embodiment of the present invention relates to compositions comprising an antibody 
directed against hABC or fragment thereof, and to a method to decrease uptake of HDL-cholesterol 
25 comprising the step of inhibiting hABC binding to HDL-cholesterol using an anti-hABC antibody. 
Preferably, such compositions comprise the preferred antibodies described above. Such 
compositions can be administered to a cell, a tissue sample or a patient. Preferably, this method is 
directed to treating an individual with low circulating HDL-cholesterol level by decreasing HDL- 
cholesterol clearance. 

30 Another embodiment relates a method to decrease uptake of HDL-cholesterol comprising 

the step of inhibiting hABC expression without affecting ABCA1 expression using an antisense 
polynucleotide. In such a method, recombinant expression vectors comprising a polypeptide that 
yields an RNA that is complementary to the sequence of the hABC mRNA can be administered to 
cell, a tissue sample or a patient. Preferred such an antisense polynucleotide is described above. 

35 Preferred expression vectors include viral vectors, especially adenoviral and lentiviral vectors. 
Preferably, the antisense polynucleotides of the present invention are administered to hepatocytes. 
hi another embodiment, genetic modification of a cell with a vector comprising a 
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polynucleotide that yields an RNA that is complementary to the sequence of the hABC mKNA 
may be accomplished using one or more techniques well known in the gene therapy field. For 
example, one of the methods described in Mulligan (Mulligan, Science, 260:926-32 (1993)), which 
disclosure is hereby incorporated by reference in its entirety, can be used. Preferably, such a 

5 method is directed to treating an individual with low circulating HDL-cholesterol level by 
decreasing HDL-cholesterol clearance. 

A further embodiment of the present invention is directed to substances that decrease 
hABC expression without affecting ABCA1 expression, and to a method of screening for such 
substances comprising the steps of: i) contacting a cell with a test substance, ii) comparing hABC 

10 expression in the cell after exposure to that of an unexposed control cell, iii) comparing ABCA1 
expression in the cell after exposure to that of an unexposed control cell, iv) quantifying said 
expression levels, and v) determining the ratios of hABC and ABCA1 expression in an exposed 
cell relative to the expression in an unexposed cell. Preferably, hABC expression is studied in an 
hepatocyte and ABCA1 expression is studied in a macrophage. 

15 In another preferred embodiment, compositions comprising substances that decrease 

hABC expression without affecting ABCA1 expression can be administered to patients presenting 
low levels of HDL-cholesterol. 

Additionally, the compositions comprising an antibody directed against hABC or 
substances that decrease hABC expression without affecting ABCA1 expression can be processed 

20 in accordance with conventional methods of pharmacy to produce medicinal agents for 

administration to patients. For example, the pharmaceutical composition comprising an antibody 
directed against hABC substances that decrease hABC expression without affecting ABCA1 
expression may be made up in a solid form (e.g. granules for oral administration, powders for 
inhalation) or in a liquid form (e.g. solutions for oral administration or for injection). 

25 Effectiveness of the compositions can be verified in vivo by measuring the plasma HDL- 

cholesterol level of an animal model before and after administration of the composition of the 
present invention. The circulating HDL-cholesterol level can for example be measured using the 
fast pressure liquid chromatography technique as described in U.S. Patent 5,962,322, which 
disclosure is hereby incorporated by reference in its entirety. The dosage regimen for treating a 

30 human patient presenting low circulating HDL-cholesterol with compositions of the present 
invention may vary widely, but can be determined using standard methods. For example, the 
amount of antibody directed against or substances that decrease hABC expression without 
affecting ABC A I expression is an amount sufficient to increase circulating low HDL-cholesterol 
in the plasma of a subject. 

35 The compositions of the invention may be administered alone or in combination with other known 
agents increasing circulating HDL-cholesterol level, e.g., gemfibrozil, niacin and the SR-BI HDL- 
cholesterol receptor. Diseases associated with low circulating HDL-cholesterol level that may be 
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treated by compositions and methods of the present invention include, but are not limited to, 
artherosclerosis, angioplasty, dyslipidemia associated with non insulin-dependant diabetes 
mellitus, obesity and various other coronary artery diseases. 

Protein of SEQID NO:104 (Internal designation Clone 620315JL88-13-1-0-G12-F) 
5 The cDNA of Clone 620315_188-13-l-0-G12-F (SEQID NO: 103) encodes MOBP-81h 

of SEQ ID NO: 104, comprising the amino acid sequence: 
MSQKPAKEGPRLSKNQKYSEHFSHCC 

CCACQKTRLKRKDIPTPKKK. Accordingly, it will be appreciated that all characteristics and 
uses of the polypeptides of SEQ ID NO: 104 described throughout the present application also 

10 pertain to the polypeptides encoded by the nucleic acids included in Clone 620315_188-13-l-0- 
G12-F. In addition, it will be appreciated that all characteristics and uses of the polynucleotides of 
SEQ ID NOs:103 described throughout the present application also pertain to the nucleic acids 
included in Clone 620315_J88-13-l-0-G12-F. A preferred embodiment of the invention is 
directed toward the compositions of SEQ ID NO: 104, SEQ ID NO:103, and Clone 620315_188- 

15 13-1-0-G12-F. Also preferred are polypeptide fragments having a biological activity as described 
herein and the polynucleotides encoding the fragments. 

The protein of the present invention, named MOBP-8 Ih, is a novel splice variant of the 
myelin-associated oligodendrocyte basic protein (MOBP, Genbank accession number 
BAA05659). MOBP-8 lh is only 81 amino acids long, whereas MOBP is 183 amino acids long. 

20 The first exon is identical between the two cDNAs. MOBP-8 lh lacks the second exon of MOBP, 
and the twelve carboxyl-terminal amino acids of MOBP-8 lh are unique to this splice variant. 
MOBP-81h, is the first splice variant described for the human MOBP protein. MOBP-81h is 
specifically expressed in CNS oligodendrocytes, and plays a role in maintaining myelin sheath 
integrity. 

25 An embodiment of the invention is directed to a composition comprising a MOBP-81h 

polypeptide sequence of SEQ ID NO: 104. 

A further embodiment of the invention is directed to a composition comprising a MOBP- 
81h polypeptide fragment having biological activity. 

A further embodiment of the invention is directed to a composition comprising a 
30 polynucleotide sequence of SEQ ID NO:103 encoding a MOBP-81h polypeptide. 

A further embodiment of the invention is directed to a composition comprising a 
polynucleotide sequence encoding a MOBP-8 lh polypeptide fragment having biological activity. 

In another embodiment, the compositions of the present invention comprise MOBP-8 lh 
polypeptides. The method of producing MOBP-8 lh polypeptides comprises the steps of: i) 
35 transfecting a mammalian host cell with a recombinant expression vector comprising a 

polynucleotide of the present invention, and ii) purifying the produced protein. The purification of 
the protein can be done following any technique well-known to those skilled in the art. Preferably, 
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an antibody directed against MOBP-81h or fragment thereof may be bound to a chromatographic 
support to form an affinity chromatography column. Even more preferably, the antibody 
recognizes the twelve carboxyl-terminal amino acids of MOBP-81h. 

An embodiment of the present invention relates to methods of using the polypeptides and 
5 the polynucleotides of the present invention to treat or to reduce in severity demyelinating 
disorders. Any compositions and methods containing, e.g., MOBP-81h polypeptide or fragment 
thereof, a polynucleotide encoding the protein, or a compound that increases the expression or 
activity of MOBP-8 lh can be used. 

In an embodiment, the methods of the present invention relate to the administration of a 
10 recombinant expression vector comprising one of the polynucleotides of the invention to a patient 
suffering from a demyelinating disease. Preferred expression vectors include viral vectors, 
especially adenoviral and lentiviral vectors. 

In another embodiment, genetic modification of a cell with a vector comprising one of the 
polynucleotides of the invention may be accomplished using one or more techniques well known 
15 in the gene therapy field. For example, one of the methods described in Mulligan (Mulligan, 
Science, 260:926-32 (1993)), which disclosure is hereby incorporated by reference in its entirety, 
can be used. 

In still another embodiment, the compositions of the present invention comprise a 
- substance that increases MOBP-8 lh expression. 

20 Additionally, the methods of the present invention relate to methods of screening test 

substances that increase MOBP-81h expression. These methods comprise the steps of: i) 
: contacting a cell with a test substance; and ii) comparing MOBP-81h expression in the cell after 
exposure to the test substance to that of an unexposed control cell. Preferably, the test substance 
modifies the expression of MOBP-8 lh in oligodendrocytes while not in other cell types. 

25 Effectiveness of compositions and methods of the present invention to treat demyelinating 

diseases can be verified in vitro by studying the effects of the compositions of the present 
invention on the morphology of myelin sheaths by immunoelectron microscopy. Effectiveness of 
compositions and methods of the present invention to treat demyelinating diseases can be verified 
in vivo using experimental models of demyelinating disorders, e.g., TMEV-infected mice. 

30 Effective doses of the polypeptides or polynucleotides of the present invention for treating a 
patient suffering from demyelinating disorders can be determined according to the relevant 
techniques. For example, the effective amounts of compositions of the present invention can be 
determined by measuring the necessary and sufficient amount of composition for disappearance or 
reduction in severity of clinical manifestations associated with demyelinating disorders (e.g. 

35 tremor, tonic seizure, unstable locomotion, ataxia). 

hi a preferred embodiment, MOBP-8 lh polypeptides or a substance that increases MOBP- 
81h expression can be processed in accordance with conventional methods of pharmacy to produce 
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medicinal agents for administration to patients. Thus, the pharmaceutical composition comprising 

MOBP-81h or fragment thereof or a substance that increases MOBP-81h expression may be made 

up in a solid form (e.g. granules for oral administration, powders for inhalation) or in a liquid form 

(e.g. solutions for oral administration or for injection). 
5 The compositions of the invention may be administered alone or in combination with other known 

agents treating demyelinating disorders, e.g., imidazol derivatives or MBP molecules. 

Demyelinating disorders that may be treated by a composition containing MOBP-8 Ih or fragment 

thereof include but are not limited to leukodystrophies (e.g. Krabbe's disease, metachromatic 

leukodistrophy, ALD, Canavan disease, Alexander disease), leukoencephalopathies, multiple 
10 sclerosis and virus-induced inflammatory demyelination. 

Protein of SEQ ID NO: 106 (Internal designation Clone 646477_181-19-2-0-F4-F) 
The cDNA of Clone 646477_181-19-2-0-F4-F (SEQ JD NO: 105) encodes novel 

Apolipoprotein H (NAPOH) of SEQ ID NO:106, comprising the amino acid sequence: 

MISPVLILFSSFLCHVAIAGR^ 
15 FICPLTGLWLINTLKC^ 

EEGKWSPELPVCAPHCPPPSIPTFATLRVYKPSAGNNSLYKDTAVFE 

CITHGNWTBGLPECREVKCTFPSRPDNGFVNYP 

CTKXGNWSAMPSCKASCKWVK^ 

KCSYTEDAQCIDGTIEWKCFKEHSSLAFWKTDASDVKPC. Accordingly, it will be 

20 appreciated that all characteristics and uses of the polypeptides of SEQ ID NO: 106 described 
throughout the present application also pertain to the polypeptides encoded by the nucleic acids 
included in Clone 646477_181-19-2-0-F4-F. In addition, it will be appreciated that all 
characteristics and uses of the polynucleotides of SEQ ID NO: 105 described throughout the 
present application also pertain to the nucleic acids included in Clone 646477_181-19-2-0-F4-F. A 

25 preferred embodiment of the invention is directed toward the compositions of SEQ ID NO: 105, 
SEQ ID NO: 106, and Clone 646477_1 81-1 9-2-0-F4-F. Also preferred are polypeptide fragments 
. having a biological activity as described herein and the polynucleotides encoding the fragments. 

The protein of SEQ ID NO: 106 is a polymorphic variant of the sequence of apolipoprotein 
H or beta-2-glycoprotein I precursor (swissprot accession numberP02749). Like apoliprotein H, 

30. the protein of the invention displays 4 Sushi domains (PF00084) and one sushi-like domain, from 
amino acids 23 to 79 (Sushi 1), amino acids 84 to 137 (Sushi 2), amino acids 142 to 200 (Sushi 3), 
. amino acids 205 to 260 (Sushi 4) and amino acids 263 to 345 (Sushi-like). Sushi domains are also 
known as Complement control protein (CCP) modules, or short consensus repeats (SCR), exist in a 
wide variety of complement and adhesion proteins. Also, it has been reported that the domain V 

35 (sushi-like domain) specifically interacts with hydrophobic ligands (Hong, D.P. et al., 

Biochemistry 40:8092-8100 (2001)). Novel apolipoprotein H, the protein of SEQ ID NO:106, is 
highly expressed in liver. 
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Novel apolipoprotein H is a plasma protein with the ability to bind with various kinds of 
negatively charged substances. Novel apolipoprotein H (NAPOH) may prevent activation of the 
intrinsic blood coagulation cascade by binding to phospholipids on the surface of damaged cells. 
NAPOH is a strong auto-antigen that stimulates a vigorous B cell-humoral response and T cell 

5 immunity response. NAPOH has been implicated in a variety of physiologic pathways including 
lipoprotein metabolism, artherosclerosis and in the production of antiphospholipid autoantibodies 
("aPA"). NAPOH also binds to platelets, mitochondria, heparin, DNA, and anionic phospholipids, 
and has been shown to be involved in the blood coagulation pathway, platelet aggregation, and 
prothrombinase acitvity of platelets. NAPOH exerts multiple inhibitory effects on the coagulation 

10 pathway and platelet aggregation. NAPOH is considered to be a required cofactor for anionic 
phospholipids antigen by the aPA found in sera of many patients with chronic inflammatory 
disease, like systemic lupus erythematosus, and primary antiphospholipid syndrome, but it does not 
seem to be required for the reactivity of aPA associated with infections. These studies suggest that 
the NAPOH-phospholipid compex forms the antigen to which aPA are directed. Autoantibodies to 

1 5 phospholipid-free NAPOH are present in patients with primary antiphospolipid syndrome. 
Antiphospholipid autoantibodies are a heterogeneous group of autoantibodies including most 
commonly a lupus anticoagulant and anticardiolipin antibodies which are directed against 
negatively charged phospholipids. The presence of antiphospholipid autoantibodies has been 
associated with recurrent deep vein thrombosis and other thrombotic complications, including 

20 pulmonary, renal, and retinal thrombosis, as well as Budd-Chiari syndrome. In addition, 

antiphospholipid autoantibodies have been associated with arterial thrombosis including cerebral, 
retinal, and peripheral arteries. Recurrent fetal losses, usually occurring in the second and third 
trimester, felt to be due in part to thrombosis of the placental vessels and subsequent infarction 
resulting in placental insufficiency and ultimately fetal loss are associated with antiphospholipid 

25 autoantibodies. 

An embodiment of the invention is directed to a composition comprising a novel 
Apolipoprotein H (NAPOH) polypeptide sequence of SEQ ID NO: 106. 

A further embodiment of the invention is directed to a composition comprising a NAPOH 
polypeptide fragment having biological activity. 
30 A further embodiment of the invention is directed to a composition comprising a 

polynucleotide sequence of SEQ ID NO: 105 encoding a NAPOH polypeptide. 

A further embodiment of the invention is directed to a composition comprising a 
polynucleotide sequence encoding a NAPOH polypeptide fragment having biological activity. 

Preparation and purification of the protein of SEQ ID NO: 106 or fragments thereof may be 
35 carried out as described in Patent US-5,859,2 1 3, the disclosure of which is incorporated herein by 
reference in its entirety. For example, a method of purifying NAPOH from human blood plasma 
comprising the steps: i) heating and cooling the plasma to obtain a precipitate and a supernatant, ii) 
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separating the supernatant and acidifying the supernatant, iii) adding a precipitation agent to the 
supernatant and separating aqueous albumin solution from second precipitate, iv) subjecting the 
aqueous albumin solution to affinity chromatography; and v) eluting the particulate support to 
obtain NAPOH. 

5 A further embodiment of the invention is directed to a method of screening test substances 

for activators or inhibitors of NAPOH expression comprising the steps of: i) contacting a cell with 
a test substance; and ii) comparing NAPOH expression in the cell after exposure to the test 
substance to that of an unexposed control cell As a result, such NAPOH activators are of great 
potential as new drugs due to their ability to induce coagulation and are expected to be useful in 

10 treatment of various coagulation disorders (including but not limited to hereditary disorders, such 
as hemophilias and disseminated intravascular coagulation, a severe hemorragic syndrome) or to 
enhance coagulation and other hemostatic events in treating wounds resulting from trauma, surgery 
or other causes. Alternatively, such NAPOH inhibitors can be useful in treatment of autoimmune 
diseases and thrombotic diseases. 

1 5 A further embodiment of the invention is directed to a method of screening for test 

substances that specifically bind to NAPOH and prevent binding to antiphospholipid 
autoantibodies to comprising the steps of: i) contacting a test substance with NAPOH polypeptide 
in the presence of antiphospholipid autoantibodies, under conditions that allow binding of NAPOH 
to antiphospholipid autoantibodies and ii) detecting the amount of antiphospholipid autoantibodies 

20 bound to NAPOH in the presence and absence of the test substance by methods common to the art. 
Preferably, the test substance is able to inhibit NAPOH interaction with antiphospholipid 
autoantibodies. Interaction of NAPOH with autoantibodies is linked to antiphopoholipid syndrome 
and more specifically to autoimmune artherogenesis. 

A further embodiment of the invention is directed to a method of screening substances for 

25 modulators of NAPOH expression comprising the steps of: i) contacting a cell with a test 

substance; and ii) comparing NAPOH expression in the cell after exposure to the test substance to 
that of an unexposed control cell. NAPOH expression is determined by methods common to the art 
or included herein, by detecting NAPOH polynucleotides or polypeptides. An example of this 
method comprises the step of: i) culturing two equivalent cell samples; ii) adding a test substances 

30 to one of the cultures and not the other; iii) harvesting both cultures at a specified time; iv) 

purifying the mRNA from each sample of cells; v) comparing the level of NAPOH mRNA in each 
sample by Northern Blot, RTPCR, or another method common to the art. The invention provides 
for design and use of specific polynucleotide probes and primers, as described herein. An 
additional example comprises the step of: i) having two equivalent cultures of cells; ii) adding a 

35 test substance to one of the cultures and not the other; iii) harvesting both cultures; iv) purifying 
the protein from each sample of cells; v) comparing the level of NAPOH polypeptides in each 
sample by Western blot, immunohistochemistry, or another method common to the art. The 
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invention provides for design and use of specific antibodies and antibody fragments, as discussed 
herein. As a result, such NAPOH activators are of great potential as new drugs due to their ability 
to induce coagulation and are expected to be useful in treatment of various coagulation disorders 
(including but not limited to hereditary disorders, such as hemophilias and disseminated 
5 intravascular coagulation, a severe hemorragic syndrome) or to enhance coagulation and other 
hemostatic events in treating wounds resulting from trauma, surgery or other causes. 
Alternatively, such NAPOH inhibitors can be useful in treatment of autoimmune diseases and 
thrombotic diseases by decreasing the level of inflammatory aPAs. In another embodiment, the 
invention relates to methods for vising the protein of the invention or fragments to identify 

10 autoantibodies which are related to autoimmune disease and systemic lupus erythematosus (SLE). 
Accordingly, the present protein may be used to detect the presence of autoantibodies. In a typical 
embodiment, the protein of SEQ ID NO: 106 is labeled with any detectable moiety including, but 
are not limited to, a fluorescent label, a radioactive atom, a paramagnetic ion, biotin, a 
chemiluminescent label or a label which can be detected through a secondary enzymatic or binding 

15 step. The invention further provides a method of diagnosing SLE, and distinguishing such 
processes from other diseases. 

An antagonist of the protein of SEQ ID NO: 106 may be produced using methods which are 
generally know in the art. The antagonist will affect the binding activity of NAPOH to negatively 
charged phopholipids which are implicated in autoimmune disorders. In one aspect, the protein of 

20 the invention or a fragment thereof may be used to synthesize specific antibodies using any 
techniques known to those skilled in the art including those described therein. In particular, 
purified NAPOH may be used to produce antibodies or to screen libraries of pharmaceutical agents 
to identify those which specifically bind NAPOH. NAPOH may thus be used for the 
characterization and assay of antibody against this protein in patients suffering from autoimmune 

25 disorder. 

The ability of the protein of the invention or fragment thereof to function as a major 
antigen for antiphospholipid antibodies may be assessed using techniques well known to those 
skilled in the art. The ability of the protein of the invention or fragment thereof, especially 
fragments containing Sushi motifs and Sushi-like motifs, to bind to antiphospholipid 

30 autoantibodies may be assessed using techniques well known to those skilled in the art including 
those described herein. For example, the protein of SEQ ID NO: 1 06 or a fragment thereof may be 
fixed to a solid support, such as a chromatograpy matrix. A preparation containing 
antiphopholipid autoantibodies is placed in contact with the protein of the invention under 
conditions which facilitate binding to NAPOH. The support is washed and then the 

35 antiphopholipid autoantibodies are released from the support by contacting the support with agents 
which cause antiphopholipid autoantibodies to dissociate from the NAPOH. 

An embodiment of the present invention relates to methods of using the protein of the 
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invention or fragment thereof, particularly polypeptides containing Sushi motifs, or derivative 
thereof to identify and/or quantify binding autoantibodies, preferably anti phospholipid 
autoantibodies, in a biological sample, and thus used in assays and diagnostic kits for the 
quantification of such binding proteins in bodily fluids, in tissue samples, and in mammalian cell 
5 cultures. Such assays may be particularly useful as diagnostic or prognostic tools in the detection 
and monitoring of a disorder linked to primary antiphospholipid syndrome. The binding activity of 
the protein of the invention or fragment thereof may be assessed using any method familiar to 
those skilled in the art. Preferably, a defined quantity of the protein of the invention or fragment 
thereof is added to the sample under conditions allowing the formation of a complex between the 

1 0 protein of the invention or fragment thereof and the binding protein to be identified and/or 
quantified. Then, the presence of the complex and/or or the free protein of the invention or 
fragment thereof is assayed and eventually compared to a control using any of the techniques 
known by those skilled in the art. 

In another embodiment, an array of oligonucleotides probes comprising the nucleotide 

15 sequence of SEQ ID NO:105 or fragments thereof can be constructed to conduct efficient 

screening of e.g., genetic mutations. The microarray can be used to monitor the expression level of 
large numbers of genes simultaneously and to identify genetic variants, mutations, and 
polymorphisms. This information may be used to determine gene function, to understand the 
genetic basis of a disorder, to diagnose a disorder, and to develop and monitor the activities of 

20 therapeutic agents (see for example: Chee, M. et al., Science, 274:610-614 (1 996) which disclosure 
is hereby incorporated by reference in its entirety). For example, it has been shown that genetic 
variants, mutations, and polymorphisms are related to thrombotic related disease and chronic 
inflammatory disease (described in U.S. Pat. Nos: 6,203,980 Bl, the disclosure of which is 
incorporated herein by reference in its entirety). 

25 In addition, NAPOH is involved in the fertilization process. The addition of the purified protein to 
, prepared sperm samples from normospermic men increases significantly the straight line velocity 
(VSL) and the amplitude of lateral head displacement (ALH). Storage of sperm is of widespread 
importance in commercial animal breeding programs, human sperm donor programs, and in the 
treatment of certain disease states. For example, sperm samples may be frozen for men who have 

30 been diagnosed with cancer or other diseases that may eventually interfere with sperm production, 
as well as for assisted reproduction purposes where sperm may be stored for use at other locations 
or times. The procedures utilized in such cases include: washing a sperm sample to separate out 
the sperm-rich fraction from non-sperm components of a sample such as seminal plasma or debris; 
further isolating the healthy, motile sperm from dead sperm or from white blood cells in an 

35 ejaculate; freezing or refrigerating of sperm for use at a later date or for shipping to females at 
differing locations; extending or diluting sperm for culture in diagnostic testing or for use in 
therapeutic interventions such as in vitro fertilization or intracytoplasmic sperm injection (Cohen et 
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al. 12: 994-1001 (1997)). Once sperm have been washed or isolated, they are then extended (or 
diluted) in culture or holding media for a variety of uses (sperm analysis, diagnostic tests, assisted 
reproduction). Each of these uses for extended or diluted sperm requires a somewhat different 
formulation of basal medium (see, for review, US Patent No. 6,140,121 Ellington et al. Oct. 2000); 
5 however, in all cases sperm survival is suboptimal outside of the female reproductive tract. Novel 
additional components of a dilution or storage medium which could improve the functional 
preservation of sperm would be useful. Therefore, in another preferred embodiment of this 
invention, purified recombinant proteins encoded by SEQ ID NO: 106 or fragments thereof can be 
added as components of pharmacological media designed to protect spermatozoa. The methods 

10 used to compose such preservation media are generally known by those skilled in the art (for 
example, Oliver S.A., et al. US patent 5,897,987 Apr.1999; Cohen J. et al., supra). Inversely, in 
yet another embodiment of this invention, ligands, inhibitors, neutralizing antibodies or other 
biological agents which recognize the protein of the invention and which bind it and which block it 
can be used as components of pharmacological formulations designed for male contraception 

15 purposes. 

Protein of SEQ ID NO:108 (Internal designation Clone 113165 _1 05-056-3-0-G12-F) 

The cDNA of clone 1 13 1 65 (SEQ ID NO: 107) encodes the protein of SEQ ID NO: 108, 

comprising the amino acid sequence: 

MAAGGSGVGGKRSSKSDADSGFLGLRPTSVDPALft 
20 PLGLEVDQFLEDVRLQERTSGGLLSEAPNEKLFFVOT 

PLRVDLILENTSKWAPKDVLAH^ 

NPSATRAKJPGPQDWERPFYDLW^ 

QAPAWVAPAGASYNPSFEDHQTLLSAAHEV 

FQELCEGLLEESDGEGEPGQGEGPEAGDAEVCPTPARLATT^ 
25 QQAALRAARLRHQELFRLRGKAQV^ 

APDroVQLSSELTDSLRTLKPEGNILRDRFK^ 

REIQL. Accordingly, it will be appreciated that all characteristics and uses of polypeptides of 
SEQ ID NO: 108 described throughout the present application also pertain to the polypeptides 
encoded by the nucleic acids included in clone 1 13 165_105-056-3-0-G12-F. In addition, it will be 

30 appreciated that all characteristics and uses of the polynucleotides of SEQ ID NO: 107 described 
throughout the present application also pertain to the nucleic acids included in clone 1 13165_105- 
056-3-0-G12-F. A preferred embodiment of the invention is directed toward the compositions of 
SEQ IDNO:107, SEQ IDNO:108, and clone 113165_105-056-3-0-G12-F. Also preferred are 
polypeptide fragments having a biological activity as described herein and the polynucleotides 

35 encoding the fragments. 

The cDNA of SEQ ID NO: 107 is a novel human JNK3-binding protein named hJNK3-BP, 
homologous to a murine JNK3-binding protein (GSP:AAB 12882). The cDNA of SEQ ID NO:107 
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encodes a 478 amino-acid protein of SEQ ID NO: 1 08, which is predominantly expressed in the 
brain. 

The c-Jun NH2-terminal kinase (JNK) signal transduction pathway is activated in response 
to various environmental stress and by the engagement of several classes of cell surface receptors. 
5 In mammalian cells, JNK has been implicated in the immune response, oncogenic transformation 
and apoptosis. These effects of JNK are mediated, at least in part, by increased gene expression. 
Three mammalian genes encode JNK protein kinases. JNK1 and JNK2 are expressed ubiquitously, 
while JNK3 is expressed primarily in the brain (Ip YT, Davis RJ Curr Opin Cell Biol 1998 
Apr;10(2):205-19). By performing a yeast two-hybrid screen specifically with JNK3 as a bait, Ito 

10 M, et al (Ito M, Yoshioka K, Akechi M, Yamashita S, Takamatsu N, Sugiyama K, Hibi M, 
Nakabeppu Y, Shiba T, Yamamoto KI. Mol Cell Biol 1999 Nov;19(l l):7539-48) isolated mouse 
Jsapl (for JNK/stress-activated protein kinase-associated protein 1), also known as Jip-3 (Kelkar 
N, Gupta S, Dickens M, Davis RJ, Mol Cell Bio 2000 20:1030-1043). Jip-3 represents a JNK- 
interacting proteins (JIPs), such as Jip-1 and Jip-2, acting as scaffolding proteins that may regulate 

15 signal transduction by the JNK signaling pathway. The protein of the invention hJNK3-BP 
specifically binds JNK3 protein kinase, modulating the biological effects of JNK3 signaling 
pathway in cells. UNK3-BP represents the founding member of a new class of scaffold protein 
involved in the regulation of the JNK3 cascade. 

An embodiment of the invention is directed to a composition comprising a hJNK3-BP 

20 polypeptide sequence of SEQ ID NO: 108. 

A further embodiment of the invention is directed to a composition comprising a hJNK3- 
BP polypeptide fragment having biological activity. 

A further embodiment of the invention is directed to a composition comprising a 
polynucleotide sequence of SEQ ID NO: 107 encoding a WNK3-BP polypeptide. 

25 A further embodiment of the invention is directed to a composition comprising a 

polynucleotide sequence encoding a 1JNK3-BP polypeptide fragment having biological activity. 

In one embodiment, the present invention provides a method of producing a recombinant 
protein capable of effectively modulating JNK activity. The protein of the invention can be 
produced in host cells that have been transfected with an appropriate expression vector cdmprising 

30 a nucleic acid sequence coding for UNK3-BP polypeptides. Introduction of an expression vector 
incorporating a nucleic acid sequence coding for the protein of the invention into a host cell can be 
performed in a variety of ways, including but not limited to calcium or lithium chloride treatment, 
electroporation, or lipofection. Any of a wide variety of expression systems can be used to 
provide the recombinant proteins. Suitable expression vehicles include, but are not limited to 

35 plasmids, viral particles and baculo virus for insect cells. The expression vehicle can be integrated 
into the host cell genome. In some circumstances, it is desirable to employ an inducible expression 
vector. The host cells harboring the expression vehicle are cultured in conventional nutrient 



310 



WO 02/094864 



PCT/IB01/01715 



media, under conditions whereby the nucleic acid sequence coding for this particular protein is 
expressed. After a suitable amount of time for the product to accumulate, the protein is purified 
from the host cells. 

In another embodiment, the present invention provides a method of effectively modulating 
5 INK activity in cells. The level or activity of hJNK3-BP can be increased in cells to decrease or 
inhibit specific JNK protein kinase activity, thereby preventing JNK3 -associated apoptosis. 
hJNK3-BP levels may be increased by introducing UNK3-BP polynucleotides or polypeptides into 
a cell in an amount sufficient to specifically inhibit JNK protein kinase activity of one or more 
cells within the sample. Such methods can be performed either in vitro or in vivo. The level of 

10 WNK3-BP can be increased in cells in any of a number of ways. For instance, purified hJNK3-BP 
protein may be introduced to the cells by microinjection or by liposome or micelle-mediated 
transport. Such liposomal or micellar microcapsule may optionally be combined with a cell type- 
specific target, such as an antibody or receptor ligand. Alternatively, hJNK3-BP polynucleotides 
may be introduced to a cell by methods common to the art such as transfection, electroporation, or 

15 viral transduction. Cyclodextrin, liposome or micelle-mediated transport may also be used to 
introduce MNK3-BP polynucleotides to a cell. Useful examples of the above methods are 
described in U.S. Patents 5019369, 5616565, 61 10490, 6204060, and P.C.T. WO9704748, 
disclosures of which are hereby incorporated by reference in their entireties, hi addition, any 
compound that increases the expression of WNK3-BP polypeptides can be used to decrease JNK 

20 protein kinase activity within one or more cells of the sample. Such compounds can be identified 
by screening for test substances that increase hJNK3-BP expression comprising the steps of: 
contacting a cell with a test substance and comparing hJNK3-BP expression in the cell after 
exposure to the test substance to that of an unexposed control cell. 

The present invention provides an in vitro method to inhibit apoptosis induced by JNK 

25 activation to keep cells alive in culture. Preferably, the present invention is suited to the culturing 
of cells for purposes including transplantation or implantation of such cells in vivo after an ex vivo 
introduction of hJNK3-BP polynucleotides. Said polynucleotides may be introduced to a cell of 
interest by methods known in the art, such as those listed above. Furthermore, such a method can 
be used with neurons or other cell types which undergo apoptosis in culture. Transplantation of 

30 healthy neurons expressing a hJNK3-BP into subjects whose neurons are degenerating can 
alleviate some effects of the neuronal diseases or disorders. Treated cells can be grafted, in 
particular, into the brain, either as cells cultured in vitro on a support matrix using techniques 
disclosed in the US Patent 6,264,943, which disclosure is hereby incorporated by reference in its 
entirety, or as dispersed cells. 

35 The present invention also provides animal models generated by modulating the expression 

or activity of the present protein in one or more tissues of the animal. Preferably, the expression of 
hJNK3-BP polypeptides is targeted in the brain. The transgene can be integrated as a single 
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transgene or in concatamers, e.g., head-to-head tandems or head-to-tail tandems. The transgene 
can also be selectively introduced into and activated in a particular cell type using a conditional 
expression system. Such animals are useful for a number of purposes, because they represent an in 
vivo assay method for testing candidate molecules potentially useful for the treatment of various 

5 pathophysiological aspects of diseases specifically related to the activity or consequence of the 
activity of hJNK3-BP polypeptides on JNK biological effects. Study of the phenotype of such 
models can also allow the identification of additional human diseases associated with JNK 
abnormal activity. These animals can be generated with any method of targeting overexpression or 
inactivation of hJNK3-BP to produce the founder lines of transgenic animals. Such models are 

10 extremely useful, e.g. in the assessment of candidate therapies and drugs for the treatment of 
neurodegenerative diseases and autoimmune or malignancy conditions. 

In other embodiment, the protein of the invention or fragment thereof is used to diagnose 
diseases or disorders associated with abnormal hJNK3-BP activity and in particular with altered 
JNK biological effects. In particular, it is useful in diagnosing patients with deficient amounts of 

15 hJNK3-BP which results in uncontrolled activity of JNK protein kinase and monitor hJNK3-BP 
expression in such conditions. Preferably, the present invention provides a method of diagnose 
pathologies linked to altered apoptosis or inflammatory responses such as, but are not limited to, 
neurodegenerative diseases characterized by apoptosis, including Parkinson's disease and 
Alzheimer's disease, autoimmune diseases such as arthritis or other conditions characterized by 

20 inflammation and malignancies such as leukemias. The method comprises the steps of contacting 
a tissue sample obtained from an individual suspected of suffering from the disease or condition or 
at risk of developing the disease or condition, with a detectably labeled compound capable of 
selectively binding hJNK3-BP polypeptides or nucleic acids. For example, a polyclonal or 
monoclonal antibody or any immunologically active fragment thereof or a nucleic acid probe may 

25 be used. 

This marker may thus also play a role as prognostic indicators, preferably concerning 
inflammatory diseases. More preferably, it can be measured in tissues and fluids recovered from 
inflammatory sites. Thus, the condition of a subject can be monitored continuously and the 
quantified amount of this particular protein measured in the pathological sample can be compared 
30 with the amount quantified in a biological sample of a normal individual or with previous samples 
of the same patient 

A further embodiment of the present invention is to provide novel methods and 
compositions useful for the treatment or prevention of diseases and conditions related to the 
abnormal JNK biological effects and preferably with abnormal apoptosis. The protein of the 
35 invention or fragment thereof may be used to treat neurodegenerative diseases characterized by 
apoptosis, including Parkinson's disease and Alzheimer's disease. Other conditions that can be 
treated using the compositions and methods of the invention are autoimmune diseases such as 
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arthritis, other conditions characterized by inflammation such as inflammatory arthritis and 
bronchial asthma, and malignancies such as, but not limited to leukemias. 

In another embodiment of the present invention is to provide novel methods and 
compositions useful for the treatment or prevention of diseases and conditions associated with 
5 oxidative damage dependent on abnormal JNK biological effects. The protein of the invention or 
fragment thereof can be used to treat or prevent oxidative damage to organs such as the liver and 
kidney, and in particular, damage due to ischemia/reperfusion in heart disease and 
cardiomyopathy. More preferably, such methods and compositions can also be used to treat donor 
organs for transplantation. Indeed these organs are exposed to substantial environmental stress 
10 which can affect the normal functioning of the organs; effects of which can be blocked by JNK 
modulators such as hJNK3-BP. 

Such methods comprise the administration of a therapeutically-effective amount of hJNK3-BP 
polypeptides to mammals suffering from the disease or condition, where "effective amount" is 
meant a concentration of WNK3-BP polypeptides which is capable of modulating JNK biological 

1 5 effects. The compositions of the invention are preferably delivered to an individual in combination 
with a pharmaceutically acceptable carrier, such as a saline solution or other physiological buffer 
suitable for administration to a patient. The particular amount of the compositions of the invention 
that will be administered to the mammal for any particular condition will depend on the clinical 
condition of the patient, and other factors such as the weight, age, and route of delivery. Such 

20 composition can be administered by any suitable route. Alternatively, for treatment purposes, 
nucleic acids can be administered to the patient using any of the standard vectors and/or gene 
delivery methods known in the art. Suitable gene delivery systems include, but are not limited to 
liposomes, naked DNA and viral vectors. These compositions can comprise the protein of the 
invention, and, optionally, one or more other compounds of interest. Indeed, in this embodiment, 

25 the present invention find use in drug potentiation applications. This co-administration may be by 
simultaneous administration or by separate or sequential administrations. All of these components 
may be either obtained from natural sources or produced by recombinant genetic engineering 
techniques and/or chemical modification. 

Protein of SEQ ID NO:110 (Internal designation Clone 231462 JL17-065-1-0-G11-F) 

30 The cDNA of Clone 23 1462_1 17-065-1 -0-G1 1-F (SEQ ID NO:109) encodes the 386 amino acid 

long polypeptide, DROCK2, of SEQ ID NO:l 10 comprising the amino acid sequence: 

MCLLLSCPCHPSAHGQSMWIER^ 

NEKILMMTNQYQSDETLPINPLSMLLNGIVDPAW 

LTHIJCDLIAWQffFLGAGK 
35 DDRRVGRPRSMLRSYRQMSHSLASM 

TLPEVKLRRSKKRTKRSSV^ 

QMSFASQSMFITPALALSVAGIPGLDEAOT 
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LASKSAEEGKQPDSLSTDL. Accordingly, it will be appreciated that all characteristics and 
uses of polypeptides of SEQ ID NO:l 10 described throughout the present application also pertain 
to the polypeptides encoded by the nucleic acids included in Clone 231462 1 17-065-1-0-G1 1-F. In 
addition, it will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 

5 NO: 109 described throughout the present application also pertain to the nucleic acids included in 
Clone 231462 1 17-065-1-G-G1 1-F. A preferred embodiment of the invention is directed toward 
the compositions of SEQ ID NO: 109, SEQ ID NO:l 10, and Clone 231462 117-065-1-0-G11-F. 
Also preferred are polypeptide fragments having a biological activity as described herein and the 
polynucleotides encoding the fragments. 

10 DROCK2, the protein of SEQ ID NO: 1 10, is a splicing variant of DOCK2 (EMBL entry 

Q92608) retaining the last 370 amino acid of DOCK2, while the first sixteen amino acids 
(MCLLLSCPCHPSAHGQ) represent specificDROCK2 amino acids corresponding to signal 
sequence. The resulting isoform is thus lacking the N-terminal sequences of the DOCK2 isoform. 
However, it retains DOCK2's C-terminal domain comprising a twenty amino acid sequence 

15 (LASKSAEEGKQIPDSLSTDL) which has been shown to be involved in protein-protein 
interactions by interacting with PDZ domain of membrane-associated proteins. 

DROCK2 belongs with DOCK2 to the CDM family of signaling proteins which also 
comprises the human DOCK 180 protein and its homologues, the Ced-5 protein in Caenorhabditis 
elegans and Mbc polypeptide in Drosophila melanagaster. These proteins share extensive 

20 similarities at the amino acid level, except in their carboxyl-terminal regions that are divergent. 
CDM proteins have been implicated in polarized extension of the cell surface in their respective 
organisms. Mbc in Drosophila is necessary for myoblast fusion and for migration of epithelial 
cells, both of which require reorganization of the cytoskeleton. Ced-5 has also been shown to be 
involved in the regulation of the cytoskeleton in the nematode, loss of function of which results in 

25 defects in engulfing dead cells and in the migration of distal tip cells. Finally, the human 

DOCK180 protein which was originally identified as one of the major proteins bound to the Crkll 
adaptator protein, is involved in membrane ruffling and cell migration in nonadherent cells. It has 
been shown to transduce signals from the CrkII-pl30Cas complex to both the cytoskeleton and 
JNK pathway byactivating the low molecular weight Rac GTPase. 

30 DROCK2, in contrast to DOCK1 80, which is expressed in all tissues except in peripheral 

blood cells, is expressed only in circulating blood cells, lymphocytes and macrophages present in 
organs. Thus the protein is specifically expressed by nonadherent cells. DROCK2 is involved in 
blood cell migration and phagocytosis of apoptotic cells by macrophages where it binds to and 
activates Rac GTPAses. 

35 An embodiment of the invention is directed to a composition comprising a DROCK2 

polypeptide sequence of SEQ ID NO:l 10. 

A further embodiment of the invention is directed to a composition comprising a DROCK2 
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polypeptide fragment having biological activity. 

A further embodiment of the invention is directed to a composition comprising a 
polynucleotide sequence of SEQ ID NO: 109 encoding a DROCK2 polypeptide. 

A further embodiment of the invention is directed to a composition comprising a 
5 polynucleotide sequence encoding a DROCK2 polypeptide fragment having biological activity. 

A further embodiment of the invention is directed to a method of screening test substances 
for modulators of DROCK2 expression comprising the steps of: i) contacting a cell with a 
substance to be tested; and ii) comparing DROCK2 expression in the cell after exposure to the test 
substance to that of an untreated control cell. 
10 In one embodiment, the present invention provides a method of producing a recombinant 

protein capable of effectively increasing Rac GTPase activity. The protein of the invention can be 
produced in host cells that have been transfected with an appropriate expression vector comprising 
a nucleic acid sequence coding for the protein of the invention. Introduction into a host cell of 
such expression vector for DROCK2 can be performed in a variety of ways, including but not 
1 5 limited to calcium or lithium chloride treatment, electroporation, or lipofection. Any of a wide 
variety of expression systems can be used to provide the recombinant proteins. Suitable expression 
vehicles include, but are not limited to plasmids, viral particles or baculovirus for insect cells. The 
expression vehicle can be integrated into the host cell genome. Optionally, an inducible expression- 
vector can be used to achieve tight controlled expression of the gene in the host cell. 

Another embodiment the present invention provides methods to purify from cellular 
extracts proteins harboring one or more PDZ domain, preferably proteins belonging to the 
MAGUK (Membrane Associated and Guanylate Kinase) family, more preferably proteins selected 
from the group consisting of DLG, syntenin or PSD95 proteins, by using the present protein, 
preferably its C-terminal twenty amino acid sequence to copurify those proteins. Methods to 
affinity purify proteins are well known for those skilled in the ait. For example, the PDZ- 
containing proteins can be purified on an affinity column or on solid support like beads using the 
polypeptides of the invention. The protein to be purified using the present method can be derived 
from any source, e.g. protein expressed in vitro using an invertebrate, yeast or bacterial 
heterologous expression system. 

Another embodiment of the invention is directed to a method to increase phagocytosis of 
apoptotic cells. Preferably, this method is applied in vivo to an individual. The method comprises 
the steps of: i) removing a sample of monocytes, ii) introducing a polynucleotide encoding a 
DROCK-2 polypeptide or fragment thereof ex vivo to those cells, and iii) reinjecting the 
recombinant cells into an individual. Using such a method in combination with anticancer or 
antiviral therapies would be of particular interest for the rapid elimination of apoptotic cancer or 
infected cells. 

An embodiment of the invention provides for a method of screening test substances for 
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modulators of DROCK-2 expression. This method comprises the steps of: i) contacting a cell with 
a test substance; and ii) comparing DROCK-2 expression in the cell after exposure to the test 
substance to that of an unexposed control cell. DROCK-2 expression is determined by methods 
common to the art or included herein, by detecting DROCK-2 polynucleotides or polypeptides. 

5 An example of this method comprises the steps of: i) culturing two equivalent cell samples; ii) 
adding a test substance to one of the cultures and not the other; iii) harvesting both cultures at a 
specified time; iv) purifying the mRNA from each sample of cells; v) comparing the level of 
DROCK-2 mRNA in each sample by Northern blot, RTPCR, or another method common to the 
art. The invention provides for design and use of specific polynucleotide probes and primers, as 

10 discussed herein. An additional example comprises the steps of: i) having two equivalent cultures 
of cells; ii) adding a test substance to one of the cultures and not the other; iii) harvesting both 
cultures; iv) purifying the protein from each sample of cells; v) comparing the level of DROCK-2 
polypeptides in each sample by Western blot, immunohistochemistry, or another method common 
to the art. The invention provides for design and use of specific antibodies and antibody 

15 fragments, as discussed herein. Substances that increase DROCK-2 expression (agonists) may be 
used to increase cytoskeletal remodeling and Rac activation. Substances that decrease DROCK-2 
expression (antagonists) may be used to inhibit cytoskeletal remodeling and Rac activation. 
Methods utilizing DROCK-2 agonists and antagonists are included herein. 

A preferred embodiment of the invention provides a method of screening for test 

20 substances that bind DROCK-2 polypeptides. This method comprises the steps of: i) contacting a 
test substance with a DROCK-2 polypeptide or fragment thereof under conditions that allow 
binding; and ii) detecting the binding of the test substance by methods common to the art (e.g., 
competitive antibody-based methods such as coimmunopreciptation and Western blotting). 
Included in this method are test substances that are conjugated to an antibody, antibody fragment, 

25 cell-type specific ligand or a portion thereof. 

A further preferred embodiment of the invention provides a method of screening test 
substances that bind to DROCK-2 for antagonists of DROCK-2 activity. This method comprises 
the steps of: i) contacting a cell with a test substance; and ii) comparing DROCK-2 biological 
activity after exposure to the test substance to that of an unexposed control cell. Detection of 

30 DROCK-2 biological activity may be detected by detecting activity of Rac GTPase. An example 
of an assay detecting Rac activity comprises the steps of: exposing Rac GTPase to radiolabeled 
. GTP and detecting the amount of hydrolysis by detecting the amount of free, radiolabeled 
phosphate. 

A further embodiment of the present invention is also directed to a method to reduce the 
35 elimination rate of apoptotic cells in a patient subjected to an antiapoptotic treatment, such method 
comprising removing a sample of the monocytes/macrophages of said patient, inhibiting or 
reducing the expression of the present protein in the isolated cells ex vivo, and reinjecting the 
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modified cells to the patient. Methods to inhibit the expression of a given gene in a cell are well 
known in the art, e.g. using antisense or ribozyme strategies, any of which can be used in the 
present method. Alternatively, reduced phagocytosis of apoptotic cells in said patient can be 
achieved by interfering with the normal activity of the present protein. In such a method, the 
5 isolated monocytes are transfected ex vivo with a DROCK2 fragment corresponding to the last 
twenty carboxy-terminal amino acid prior to reinjection into the patient. Because reducing 
phagocytosis of apoptotic cells concomitantly with the administration of the antiapoptotic agent 
would help maintaining more dying cells alive and therefore available for the action of the 
antiapoptotic agent, such method would be of particular interest to increase the treatment 
10 efficiency of diseases associated with abnormal cell apoptosis, including but not limited to 
neurodegenerative disorders. 

A preferred embodiment provides a method of preventing and treating invasive neoplasms that 
require cytoskeletal remodeling (e.g., for extravasation). This method comprises the step of 
contacting an antagonist of DROCK-2 expression or activity with a cell. Preferred cells include 
. 15 nonadherent cells. Further preferred cells include lymphocytes and macrophages. Preferably, the 
DROCK-2 antagonist is delivered to a specific cell type, for example, by conjugating the 
antagonist to a cell-type specific targeting moiety (e.g., a ligand or antibody fragment). DROCK-2 
antagonists in a physiologically acceptable solution may be delivered by methods common to the 
art, such as orally or parenterally. This method is useful for prevention and treatment of leukemias 
20 and other invasive neoplasms. 

Protein of SEQ ID NO:112 (Internal designation Clone 500723589_205-34-3-0-G4-F) 
The cDNA of clone 500723589_205-34-3-0-G4-F (SEQ ID NO: 11 1) encodes Novel 17 beta- 
hydroxysteroid dehydrogenase type 2 ( NBHSD2) of SEQ ID NO: 112, comprising the amino acid 
sequence: 

25 MSTFFSDTAWICXAVPTVLCGT^ 
FSVSCFLMYTYLSGQELLPVDQ 

GAEELRRTCSPRLS VLQMDITKPVQIKDAYSKVAAMLQDRGLWA 

LLLMTDYKQCMAVNFFGTVEVTKTFLPLLRKSKGR^ 

AVTMFSSVMRLELSKWGIKVASIQPGGFLTNIAGTSDKWEKLE 

30 DY1IAQRNFLLLINSLASKDF 

YFAKKEffGQDKPMPRALRMPNYKKKAP. Accordingly, it will be appreciated that all 
characteristics and uses of the polypeptides of SEQ ID NO: 112 described throughout the present 
application also pertain to the polypeptides encoded by the nucleic acids included in Clone 
500723589_205-34-3-0-G4-F. In addition, it will be appreciated that all characteristics and uses of 

35 the polynucleotides of SEQ ID NO : 1 1 1 described throughout the present application also pertain to 
the nucleic acids included in Clone 500723589_205-34-3-0-G4-F. A preferred embodiment of the 
invention is directed toward the compositions of SEQ ID NO: 111, SEQ ID NO: 1 12, and clone 
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500723589__205-34-3-0-G4-F. Also preferred are polypeptide fragments having a biological 
activity as described herein and the polynucleotides encoding the fragments. 

The protein of SEQ ID NO: 1 12 is a polymorphic variant of the sequence of 17 beta 
estradiol dehydrogenase (swissprot accession number P37059). Like 17 beta-hydroxysteroid 

5 dehydrogenase type 2, the protein of the invention displays a short chain dehydrogenase domain 
(PF00106) spanning from positions 83 to 268, a ferredoxin domain (PS00197) spanning from 
positions 40 to 48 and an ADH-short domain spanning from positions 219 to 247. 

Novel 17 beta-hydroxysteroid dehydrogenase type 2 ( NBHSD2) is an enzyme of the 17 
beta-hydroxysteroid dehydrogenase (17 beta-HSD) gene family. The 17 beta-hydroxysteroid 

10 dehydrogenases are pivotal in controlling the biological potency of steroid hormones by catalyzing 
oxidation or reduction at position 17. 

1 7 Beta-hydroxysteroid dehydrogenases catalyze the interconversion between high-activity 
17beta-hydroxysteroids and low-activity 17-ketosteroids. Because both estrogens and androgens 
have the highest affinity towards their receptors in the 17 beta-hydroxy form, the 17 beta-HSD 

15 enzymes regulate the biological activity of sex hormones. Several 17beta-HSD may metabolize 
further substrates including alcohols, bile acids, fatty acids and retinol. The activities of 17 beta 
HSDs are essential for gonadal sex steroid biosynthesis and they are also involved in the 
modulation of steroid hormone action in peripheral tissues. This family of steroidogenic enzymes 
constitutes an interesting target in the control of the concentration of estrogens and androgens since 

20 this family is involved in the formation and inactivation of sex steroids. 

NBHSD2 catalyzes the oxidative reaction or the inactivation of sex steroids thereby 
reducing the exposure of tissues to the action of sex steroids. NBHSD2 preferentially catalyzes the 
oxidation of estradiol (E(2)) to inactive estrogen, estrone (E(l)), testosterone to 4-dione, 
dihydrotestosterone (DHT), 20alpha-dihydroprogesterone (20alpha-DHP), and androst-5-ene-3, 

25 17-diol (5-diol) to DHEA with NAD+ as the coenzyme. Therefore, NBHSD2 is involved in the 
regulation of clearance and/or metabolism of sex steroids. 

Local formation of sex steroids plays a major role in both normal and neoplastic hormone- 
sensitive tissues. 40% of all cancers, namely, breast, prostate, ovarian and uterine cancers, are sex 
steroid-sensitive and are thus prime candidates for approaches based upon the control of synthesis 

30 of active steroids in peripheral target tissues. Thus, the Tate of formation of each sex steroid 

depends upon the activity of the specific androgen- and estrogen-synthesizing enzymes in each cell 
of each tissue. Local hormone metabolism plays a key role in determining tissue responsiveness to 
. oestrogen. High capacity for inactivation of oestrogens is associated with the presence of 1 7beta- 
HSD isozymes in epithelial cells. By inactivating oestrogens, NBHSD2 plays a role in cancers, 

35 especially hormone-dependent cancers such as those stimulated by androgens or estrogens, for 
example, colon, breast, prostate, ovarian and uterine cancer. In the colon, NBHSD2 plays a role as 
attenuator of estradiol E2 bioavailability (estradiol (E2) stimulates the growth of colonic cancer 
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cell lines), and possibly as modulators of colonic cell proliferation in the pathogenesis of colon 
cancer. 

Also, bioavailibility of estradiol, one of the most potent human sex steroid hormones of 
placental origin, is essential to the maintenance of pregnancy, the timing of parturition, the 
5 maturation of many fetal organs, and the preparation of the maternal reproductive system. 

Several inhibitors of the functions of NBHSD2 have been characterized. These include: 
lindane which induces oxidative stress, progestins (promegestone, nomegestrol acetate, 
medrogestone) and tibolone and its metabolite which will provide a new possibility in the 
treatment of breast cancer, chalcones (naringenin chalcone and 4-hydroxychalcone), steroidal 
10 spirolactones inhibitors, isoflavones which have been suggested to be anticarcinogenic, 

propylthiouracil (PTU) which is an anti-thyroid drug. Such inhibitors are useful tools to regulate 
the level of active estrogens, androgens and progesterone and can exert cancer-preventive effects. 

Alternatively, retinoic acids stimulate the expression of NHBSD2 and may be involved in 
modulation of in situ estrogen metabolism in both normal and neoplastic human endometrium. 
15 An embodiment of the invention is directed to a composition comprising a NBHSD2 

polypeptide sequence of SEQ ID NO:l 12. 

A further embodiment of the invention is directed to a composition comprising a NBHSD2 
polypeptide fragment having biological activity. 

A further embodiment of the invention is directed to a composition comprising a 
20 polynucleotide sequence of SEQ ID NO: 111 encoding a NBHSD2 polypeptide. 

A further embodiment of the invention is directed to a composition comprising a 
polynucleotide sequence encoding a NBHSD2 polypeptide fragment having biological activity. 

An embodiment of the invention provides for a method of screening test substances for 
modulators of NBHSD2 expression. This method comprises the steps of: i) contacting a cell with a 
25. test substance; and ii) comparing NBHSD2 expression in the cell after exposure to the test 
substance to that of an unexposed control cell. NBHSD2 expression is determined by methods 
common to the art or included herein, by detecting NBHSD2 polynucleotides or polypeptides. An 
example of this method comprises the steps of: i) culturing two equivalent cell samples; ii) adding 
a test substance to one of the cultures and not the other; iii) harvesting both cultures at a specified 
30 time; iv) purifying the mRNA from each sample of cells; v) comparing the level of NBHSD2 
mRNA in each sample by Northern blot, RTPCR, or another method common to the art. The 
invention provides for design and use of specific polynucleotide probes and primers, as discussed 
herein. An additional example comprises the steps of: i) having two equivalent cultures of cells; ii) 
adding a test substance to one of the cultures and not the other, iii) harvesting both cultures; iv) 
35 purifying the protein from each sample of cells; v) comparing the level of NBHSD2 polypeptides 
in each sample by Western blot, immunohistochemistry, or another method common to the art. 
The invention provides for design and use of specific antibodies and antibody fragments, as 
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discussed herein. 

Agents which modulate the expression or activity of the NBHSD2 of the subject invention 
include, but are not limited to, antisense oligonucleotides, ribozymes, drugs, and antibodies. These 
agents may be made and used according to methods well known in the art. Also, the protein of the 
5 invention, or biologically active fragments thereof, may be used in screening assays for therapeutic 
compounds. A variety of drug screening techniques may be employed, hi this aspect of the 
invention, the protein or biologically active fragment thereof, may be free in solution, affixed to a 
solid support, recombinantly expressed on, or chemically attached to, a cell surface, or located 
intracellularly. The formation of binding complexes, between the protein of the invention, or 

10 biologically active fragments thereof, and the compound being tested, may then be measured. 
Another technique for drug screening which may be used provides for high throughput screening 
of compounds having suitable binding affinity to the protein of the invention as described in 
published PCT application WO84/03564, and incorporated herein by reference in its entirety. 
Another embodiment of the subject invention provides compositions and methods of 

15 selectively modulating the activity of the protein of the invention. Modulation of the NBHSD2 
activity would allow for the successful treatment and/or management of diseases or biochemical 
abnormalities associated with the NBHSD2. Antagonists, able to reduce or inhibit the expression 
or the activity of the protein of the invention, would be useful in the treatment of diseases 
associated with decreased estradiol and testosterone biosynthesis. For example, estradiol 

20 deficiency is an important pathogenetic factor in female osteoporosis. Also, antagonists of 
NBHSD2 provide methods of treating diseases including, and not limited to, cancers, especially 
hormone-dependent cancers such as those stimulated by androgens or estrogens. Andogen- 
sentitive diseases, i.e. diseases whose onset or progress is aided by androgeneic activity, are 
known, included but are not limited to prostate cancer, benign prostatic hyperplasia; acne, 

25 seborrhea, hirsutism, androgenic alopecia, precocious puberty, adrenal hyperplasia and polycystic 
ovarian syndrome. Estrogen sensitive diseases, i.e. diseases whose onset or progress is aided by 
estrogenic activity, included but are not limited to breast cancer, endometriosis, leiomyoma, and 
precocious puberty. 

Alternatively, the subject invention provides methods of treating diseases or disorders 
30 associated with decreased levels of the protein of the NBHSD2. Thus, agonists of NBHSD2 
provide methods for treating diseases with increases in estradiol and testosterone levels. 

In one embodiment, the subject method utilizes eukaryotic or prokaryotic host cells which 
are stably transformed with recombinant nucleic acids expressing the NBHSD2 polypeptide or 
biologically active fragments thereof. The transformed cells may be viable or fixed. Drugs or 
35 compounds which are candidates for the modulation of the NBHSD2, or biologically active 
fragments thereof, are screened against such transformed cells in binding assays well known to 
those skilled in the art. Alternatively, assays such as those taught in Geysen H. N., WO 
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Application 84/03564, published on Sep. 13, 1984, and incorporated herein by reference in its 
entirety, may be used to screen for peptide compounds which demonstrate binding affinity for, or 
the ability to modulate, the NBHSD2, or biologically active fragments thereof. In another 
embodiment, competitive drug screening assays using neutralizing antibodies specifically compete 
5 with a test compound for binding to the NBHSD2 protein of the invention, or biologically active 
fragments thereof. 

Agents which stimulate or inhibit the activity of the protein of the invention include but are 
not limited to agonist and antagonist drugs respectively. These drugs can be obtained using any of 
a variety of drug screening techniques as discussed above. 

1 0 Antagonists of the NBHSD2 polypeptide encoded by SEQ DD NO: 1 1 2 include agents 

which decrease the levels of expressed mRNA encoding the protein of SEQ ID NO: 1 1 2. These 
include, but are not limited to, RNAi, one or more ribozymes capable of digesting the protein of 
the invention mRNA, or antisense oligonucleotides capable of hybridizing to mRNA encoding the 
NBHSD2 polypeptide of SEQ ID NO: 112. Antisense oligonucleotides can be administrated as 

1 5 DNA, as DNA entrapped in proteoliposomes containing viral envelope receptor proteins [Kanoda, 
Y. et aL (1989) Science 243: 375, which disclosure is hereby incorporated by reference in its 
entirety] or as part of a vector which can be expressed in the target cell and provide antisense DNA 
orRNA. Vectors which are expressed in particular cell types are known in the art. Alternatively, 
the DNA can be injected along with a carrier. A carrier can be a protein such as a cytokine, for 

20 example interleukin 2, or polylysine-glycoprotein carriers. Carrier proteins, vectors, and methods 
of making and using polylysine carrier systems are known in the art. Alternatively, nucleic acid 
encoding antisense molecules may be coated onto gold beads and introduced into the skin with, for 
example, a gene gun [Ulmer, J.B. et al (1993) Science 259:1745, which disclosure is hereby 
incorporated by reference in its entirety]. 

25 Antibodies, or other polypeptides, capable of reducing or inhibiting the activity of 

NBHSD2 maybe provided as in isolated and substantially purified form. Alternatively, antibodies 
or other polypeptides capable of inhibiting or reducing the activity of the protein of the invention, 
may be recombinantly expressed in the target cell to provide a modulating effect. In addition, 
compounds which inhibit or reduce the activity of the protein of the subject invention may be 
* 30 incorporated into biodegradable polymers being implanted in the vicinity of where drug delivery is 
desired. For example, biodegradable polymers containing antagonists/agonists may be implanted 
to slowly release the compounds systemically. Biodegradable polymers, and their use, are known 
to those of skill in the art (see, for example, Brem et a/., J. Neurosurg. 74:441-446(1991) which 
disclosure is hereby incorporated by reference in its entirety). 

35 In one embodiment, methods of increasing the levels of NBHSD2 in tissues or cell types 

may be practiced by utilizing nucleic acids encoding the protein of the subject invention, or 
biologically active fragments thereof, to introduce biologically active polypeptide into targeted cell 
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types. Vectors useful in such methods are known to those skilled in the art as are methods of 
introducing such nucleic acids into target tissues. Preferred expression vectors include viral 
vectors, especially adenoviral and lentiviral vectors. For example, one of the methods described in 
Mulligan (Mulligan, Science, 260:926-32 (1993)), which disclosure is hereby incorporated by 
5 reference in its entirety, can be used. 

In another embodiment, the invention provides methods and compositions for detecting the 
level of expression of the mRNA of the protein of the invention. Quantification of mRNA levels 
of the NBHSD2 protein of the invention may be useful for the diagnosis or prognosis of diseases 
associated with an altered expression of the protein of the invention. Assays for the detection and 

1 0 quantification of the mRNA of the protein of the invention are well known in the art (see, for 
example, Maniatis, Fitsch and Sambrook, Molecular Cloning; A Laboratory Manual (1982), or 
Current Protocols in Molecular Biology, Ausubel, F.M. et al. (Eds), Wiley & Sons, Inc.). 

Polynucleotides probes or primers for the detection of NBHSD2 cDNA can be designed 
from the cDNA of SEQ ID NO: 111. Methods for designing probes and primers are known in the 

15 art. In another embodiment, the subject invention provides diagnostic kits for the detection of 
NBHSD2 cDNA in cells. The kit comprises a package having one or more containers of 
oligonucleotide primers for detection of NBHSD2 cDNA in PCR assays or one or more containers 
of polynucleotide probes for the detection of NBHSD2 cDNA by in situ hybridization or Northern 
analysis. Kits may, optionally, include containers of various reagents used in various hybridization 

20 assays. The kit may also, optionally, contain one or more of the following items: polymerization 
enzymes, buffers, instructions, controls, or detection labels. Kits may also, optionally, include 
containers of reagents mixed together in suitable proportions for performing the hybridization 
assay methods in accordance with the invention. Reagent containers preferably contain reagents in 
unit quantities that obviate measuring steps when performing the subject methods. 

25 In another embodiment, the invention relates to methods and compositions for detecting 

and quantifying the level of fee protein of the invention present in a particular biological sample. 
These methods are useful for the diagnosis or prognosis of diseases associated with an altered 
levels of the protein of the invention. Diagnostic assays to detect the protein of the invention may 
comprise a biopsy, in situ assay of cells from organ or tissue sections, or an aspirate of cells from a 

30 tumor or normal tissue. In addition, assays may be conducted upon cellular extracts from organs, 
tissues, cells, urine, or serum or blood or any other body fluid or extract. 

Assays for the quantification of the NBHSD2 polypeptide of SEQ ID NO:l 12 may be 
performed according to methods well known in the art. Typically, these assays comprise 
contacting the sample with a ligand of the protein of the invention or an antibody (polyclonal or 

35 monoclonal) which recognizes the protein of the invention or a fragment thereof, and detecting the 
complex formed between the protein of the invention present in the sample and the ligand or 
antibody. Fragments of the ligands and antibodies may also be used in the binding assays, 
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provided these fragments are capable of specifically interacting with the NBHSD2 of the subject 
invention. Further, the ligands and antibodies which bind to the NBHSD2 of the invention may be 
labeled according to methods known in the art. Labels which are useful in the subject invention 
include, but are not limited to, enzymes labels, radioisotopic labels, paramagnetic labels, and 
5 chemiluminescent labels. Typical techniques are described by Kennedy, J. H., et al (1976) Clin, 
Chim. Acta 70:1-31; and Schurs, A. H. et al (1977) Clin. Chim. Acta 81: 1-40 which disclosure is 
hereby incorporated by-reference in its entirety). 

In another embodiment, the invention relates to compositions and methods using the 
proteins of the invention or fragment thereof to screen for compounds that bind an NBHSD2 

10 polypeptide or fragment thereof. In a preferred embodiment, the proteins of the invention or 

fragment thereof may be used to identify and/or quantify substrates using any techniques known to 
those skilled in the art. To find substrates, the proteins of the invention, or fragment thereof, or 
derivative thereof, may be used for screening libraries of compounds in any of a variety of drug 
screening techniques. The fragment employed in such screening may be free in solution, affixed to 

15 a solid support, borne on a cell surface, or located intracellularly. The formation of binding 
complexes, between the proteins of the invention, or fragment thereof, or derivative thereof, and 
the agent being tested, may be measured by methods well known to those skilled in the art, like, 
but not limited to, the BIAcore (Upsala, Sweden). Antagonists or inhibitors of the proteins of the 
invention may be produced using methods which are generally known in the art, including the 

20 screening of libraries of pharmaceutical agents to identify those which specifically bind the protein 
of the invention. Another technique for drug screening which may be used provides for high 
throughput screening of compounds having suitable binding affinity to the protein of the. invention. 

In another embodiment, the present invention includes the use of NBHSD2 polypeptides, 
or fragments having a desired biological activity to treat or ameliorate a condition in an individual. 

25 For example, the condition may be deficiency of the sex steroid biosynthesis such as hormone- 
dependent disorders, or an abnormality in any of the functions of the sex steroid metabolism. In 
such embodiments, AN NBHSD2 polypeptide, or a fragment thereof, is administered to an 
individual in whom it is desired to increase or decrease any of the activities of NBHSD2 
polypeptides. A NBHSD2 polypeptide or fragment thereof may be administered directly to the 

30 individual or, alternatively, a nucleic acid encoding a NBHSD2 polypeptide or a fragment thereof • 
may be administered to the individual. Alternatively, an agent which increases the activity of 
NBHSD2 polypeptides may be administered to the individual. Such agents may be identified by 
contacting a NBHSD2 polypeptide or a cell or preparation containing NBHSD2 polypeptides with 
a test agent and assaying whether the test agent increases the activity of the protein. For example, 

35 the test agent may be a chemical compound or a polypeptide or peptide. Alternatively, the activity 
of NBHSD2 polypeptides may be decreased by administering an agent which interferes with such 
activity to an individual. Agents which interfere with the activity of NBHSD2 polypeptides may 
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be identified by contacting A NBHSD2 polypeptide or a cell or preparation containing NBHSD2 
polypeptides with a test agent and assaying whether the test agent decreases the activity of the 
protein. . Decreasing the activity of NHBSD2 would be useful for the successful treatment and/or 
management of diseases or biochemical abnormalities associated with decrease of oestradiol. For 
5 example, the agent may be a chemical compound, a polypeptide or peptide, an antibody, or a 
nucleic acid such as an antisense nucleic acid or a triple helix-forming nucleic acid. Another 
embodiment of the invention relates to composition and methods using polynucleotide sequences 
encoding the protein of the invention or fragment thereof to establish transgenic model animals (D. 
melanogaster, M. musculus), by any method familiar to those skilled in the art. By modulating in 

10 vivo the expression of the transgene with drugs or modifier genes (activator or suppressor genes), 
animal models can be developed that mimic human hormone-dependent disorders such as cancers. 
These animal models would thus allow the identification of potential therapeutic agents for 
treatment of the disorders. In addition, recombinant cell lines derived from these transgenic 
animals may be used for similar approaches ex vivo. 

15 In another embodiment, an array of oligonucleotides probes comprising the nucleotide 

sequence of SEQ ID NO: 1 1 1 or fragments thereof can be constructed to conduct efficient 
screening of e.g., genetic mutations or deletion. The microarray can be used to monitor the 
expression level of large numbers of genes simultaneously and to identify genetic variants, 
mutations, and polymorphisms. This information may be used to determine gene function, to 

20 understand the genetic basis of a disorder, to diagnose a disorder, and to develop and monitor the 
activities of therapeutic agents (see for example: Chee, M. et al., Science, 274:610-614 (1996) 
which disclosure is hereby incorporated by reference in its entirety). For example, deletion of 
genes NBHSD2 locus is a frequent target of deletion in human hepatocellular carcinoma. 

25 Uses of antibodies 

Antibodies of the present invention have uses that include, but are not limited to, methods 
known in the art to purify, detect, and target the polypeptides of the present invention including 
both in vitro and in vivo diagnostic and therapeutic methods. An example of such use using 
immunoaffinity chromatography is given below. The antibodies of the present invention may be 

30 used either alone or in combination with other compositions. For example, the antibodies have use 
in immunoassays for qualitatively and quantitatively measuring levels of antigen-bearing 
substances, including the polypeptides of the present invention, in biological samples (See, e.g., 
Harlow et al. 9 1988). (Incorporated by reference in the entirety). The antibodies may also be used in 
therapeutic compositions for killing cells expressing the protein or reducing the levels of the protein in 

35 the body. 

The invention further relates to antibodies that act as agonists or antagonists of the 
polypeptides of the present invention. For example, the present invention includes antibodies that 
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disrupt the receptor/ligand interactions with the polypeptides of the invention either partially or 
fully. Included are both receptor-specific antibodies and ligand-specific antibodies. Included are 
receptor-specific antibodies, which do not prevent ligand binding but prevent receptor activation. 
Receptor activation (i.e., signaling) may be determined by techniques described herein or 
5 otherwise known in the art. Also include are receptor-specific antibodies which both prevent 
ligand binding and receptor activation. Likewise, included are neutralizing antibodies that bind the 
ligand and prevent binding of the ligand to the receptor, as well as antibodies that bind the ligand, 
thereby preventing receptor activation, but do not prevent the ligand from binding the receptor. 
Further included are antibodies that activate the receptor. These antibodies may act as agonists for 

10 either all or less than all of the biological activities affected by ligand-mediated receptor activation. 
The antibodies may be specified as agonists or antagonists for biological activities comprising 
specific activities disclosed herein. The above antibody agonists can be made using methods 
known in the art. See e.g., WO 96/40281; US Patent 5,811,097; Deng et al, (1998) Blood. 
92(6):1981-1988; Chen et al, (1998), Cancer Res. 58(16):3668-3678; Harrop et al., (1998), J. 

IS Immunol. 161(4): 1786-1794; Zhu, et al (1998), Cancer Res. 58(15):3209-3214; Yoon, et al. 

(1998), J. Immunol. 160(7):3 170-3 179; Prat et al, (1998), J. Cell. Sci. 1 1 l(Pt2):237-247; Pitard et 
al, (1997), J. Immunol. Methods. 205(2):177-190; Liautard et al, (1997), Cytokine. 9(4):233-241; 
Carlson et al, (1997;, J. Biol. Chem 272(17):1 1295-1 1301; Taryman, et al, (1995), Neuron. 
14(4):755-762; Muller et al, (1998), Structure. 6(9): 1 153-1167; Bartunek et al, (1996), Cytokine. 

20 8(1): 14-20 (said references incorporated by reference in their entireties). 

As discussed above, antibodies of the polypeptides of the invention can, in turn, be utilized 
to generate anti-idiotypic antibodies that "mimic" polypeptides of the invention using techniques 
well known to those skilled in the art [See, e.g. Greenspan and Bona (1989), FASEB J. 7(5):437- 
444 and Nissinoff, (1991), J. Immunol. 147(8): 2429-2438, which disclosures are hereby 

25 incorporated by reference in their entireties]. For example, antibodies which bind to and 

competitively inhibit polypeptide multimerization or binding of a polypeptide of the invention to 
ligand can be used to generate anti-idiotypes that "mimic" the polypeptide multimerization or 
binding domain and, as a consequence, bind to and neutralize polypeptide or its ligand. Such 
neutralization anti-idiotypic antibodies can be used to bind a polypeptide of the invention or to 

30 bind its ligands/receptors, and thereby block its biological activity. 

Immunoaffinitv Chromatography 

Antibodies prepared as described herein are coupled to a support. Preferably, the 
antibodies are monoclonal antibodies, but polyclonal antibodies may also be used. The support 
35 may be any of those typically employed in immunoaffinity chromatography, including Sepharose 
CL-4B (Pharmacia, Piscataway, NJ), Sepharose CL-2B (Pharmacia, Piscataway, NJ), Affi-gel 10 
(Biorad, Richmond, CA), or glass beads. 
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The antibodies may be coupled to the support using any of the coupling reagents typically 
used in immunoaffinity chromatography, including cyanogen bromide. After coupling the 
antibody to the support, the support is contacted with a sample which contains a target polypeptide 
whose isolation, purification or enrichment is desired. The target polypeptide may be a 

5 polypeptide selected from the group consisting of polypeptide sequences of the Sequence Listing, 
those encoded by the clone inserts of the deposited clone pool, variants and fragments thereof, or a 
fusion protein comprising said selected polypeptide or a fragment thereof. 

Preferably, the sample is placed in contact with the support for a sufficient amount of time 
and under appropriate conditions to allow at least 50% of the target polypeptide to specifically bind 

10 to the antibody coupled to the support. 

Thereafter, the support is washed with an appropriate wash solution to remove 
polypeptides which have non-specifically adhered to the support. The wash solution may be any 
of those typically employed in immunoaffinity chromatography, including PBS, Tris-lithium 
chloride buffer (0.1M lysine base and 0.5M lithium chloride, pH 8.0), Tris-hydrochloride buffer 

15 (0.05M Tris-hydrochloride, pH 8.0), or Tris/Triton/NaCl buffer (50mM Tris.cl, pH 8.0 or 9.0, 
0.1% Triton X-100, and 0.5MNaCl). 

After washing, the specifically bound target polypeptide is eluted from the support using 
the high pH or low pH elution solutions typically employed in immunoaffinity chromatography. In 
particular, the elution solutions may contain an eluant such as triethanolamine, diethylamine, 

20 calcium chloride, sodium thiocyanate, potasssium bromide, acetic acid, or glycine. In some 

embodiments, the elution solution may also contain a detergent such as Triton X-100 or octyl-beta- 
D-glucoside. 

EXPRESSION OF GENSET GENE PRODUCTS 
25 Evaluation of Expression Levels and Patterns of GENSET polypeptide-encoding mRNAs 

The spatial and temporal expression patterns of GENSET polypeptide-encoding mRNAs, 
as well as their expression levels, may be determined as follows. 

Expression levels and patterns of GENSET polypeptide-encoding mRNAs may be 
analyzed by solution hybridization with long probes as described in International Patent 

30 Application No. WO 97/05277, the entire contents of which are hereby incorporated by reference. 
Briefly, a GENSET polynucleotide, or fragment thereof, corresponding to the gene encoding the 
mRNA to be characterized is inserted at a cloning site immediately downstream of a bacteriophage 
(T3, T7 or SP6) RNA polymerase promoter to produce antisense RNA. Preferably, the GENSET 
polynucleotide is at least a 100 nucleotides in length. The plasmid is linearized and transcribed in 

35 the presence of ribonucleotides comprising modified ribonucleotides (i.e. biotin-UTP and DIG- 
UTP). An excess of this doubly labeled RNA is hybridized in solution with mRNA isolated from 
cells or tissues of interest. The hybridizations are performed under standard stringent conditions 
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(40-50°C for 16 hours in an 80% formamide, 0.4 M NaCl buffer, pH 7-8). The unhybridized probe 
is removed by digestion with ribonucleases specific for single-stranded RNA (i.e. RNases CL3, Tl, 
Phy M, U2 or A). The presence of the biotin-UTP modification enables capture of the hybrid on a 
microtitration plate coated with streptavidin. The presence of the DIG modification enables the 
5 hybrid to be detected and quantified by ELISA using an anti-DIG antibody coupled to alkaline 
phosphatase. 

The GENSET polypeptide-encoding cDNAs, or fragments thereof, may also' be tagged 
with nucleotide sequences for the serial analysis of gene expression (SAGE) as disclosed in UK 
Patent Application No. 2 305 241 A, the entire contents of which are incorporated by reference. In 

10 this method, cDNAs are prepared from a cell, tissue, organism or other source of nucleic acid for 
which it is desired to determine gene expression patterns. The resulting cDNAs are separated into 
two pools. The cDNAs in each pool are cleaved with a first restriction endonuclease, called an 
"anchoring enzyme," having a recognition site which is likely to be present at least once in most 
cDNAs. The fragments which contain the 5' or 3' most region of the cleaved cDNA are isolated 

15 by binding to a capture medium such as streptavidin coated beads. A first oligonucleotide linker 
having a first sequence for hybridization of an amplification primer and an internal restriction site 
for a "tagging endonuclease" is ligated to the digested cDNAs in the first pool. Digestion with the 
second endonuclease produces short "tag" fragments from the cDNAs. A second oligonucleotide 
having a second sequence for hybridization of an amplification primer, and an internal restriction 

20 site is ligated to the digested cDNAs in the second pool. The cDNA fragments in the second pool 
are also digested with the "tagging endonuclease" to generate short "tag" fragments derived from 
the cDNAs in the second pool. The "tags" resulting from digestion of the first and second pools 
with the anchoring enzyme and the tagging endonuclease are ligated to one another to produce 
"ditags." In some embodiments, the ditags are concatamerized to produce ligation products 

25 containing from 2 to 200 ditags. The tag sequences are then determined and compared to the 
sequences of the GENSET polypeptide-encoding cDNAs to determine which genes are expressed 
in the cell, tissue, organism, or other source of nucleic acids from which the tags were derived. In 
this way, the expression pattern of a GENSET polypeptide-encoding gene in the cell, tissue, 
organism, or other source of nucleic acids is obtained. 

30 Quantitative analysis of GENSET gene expression may also be performed using arrays. 

For example, quantitative analysis of gene expression may be performed with GENSET 
polynucleotides, or fragments thereof in a complementary DNA microarray as described by Schena 
et al (1995) Science 270:467-470 and Schena et al (1996), Proc Natl Acad Sci U S 
A,.93(20): 10614-10619 which disclosures are hereby incorporated by reference in their entireties. 

35 GENSET polypeptide-encoding cDNAs or fragments thereof are amplified by PGR and arrayed 
from 96-well microtiter plates onto silylated microscope slides using high-speed robotics. Printed ■ 
arrays are incubated in a humid chamber to allow rehydration of the array elements and rinsed, 
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once in 0.2% SDS for 1 min, twice in water for 1 min and once for 5 min in sodium borohydride 
solution. The arrays are submerged in water for 2 min at 95°C, transferred into 0.2% SDS for 1 
min, rinsed twice with water, air dried and stored in the dark at 25°C. Cell or tissue mRNA is 
isolated or commercially obtained and probes are prepared by a single round of reverse 
5 transcription. Probes are hybridized to 1 cm 2 microarrays under a 14 x 14 mm glass coverslip for 
6-12 hours at 60°C. Arrays are washed for 5 min at 25°C in low stringency wash buffer (IX 
SSC/0.2% SDS), then for 10 min at room temperature in high stringency wash buffer (0. IX 
SSC/0.2% SDS). Arrays are scanned in 0.1X SSC using a fluorescence laser scanning device 
fitted with a custom filter set Accurate differential expression measurements are obtained by 

1 0 taking the average of the ratios of two independent hybridizations. 

Quantitative analysis of the expression of genes may also be performed with GENSET 
polypeptide-encoding cDNAs or fragments thereof in complementary DNA arrays as described by 
Pietu et aL, (1996) Genome Research 6:492-503, which disclosure is hereby incorporated by 
reference in its entirety. The GENSET polynucleotides of the invention or fragments thereof are 

15 PCR amplified and spotted on membranes. Then, mRNAs originating from various tissues or cells 
are labeled with radioactive nucleotides. After hybridization and washing in controlled conditions, 
the hybridized mRNAs are detected by phospho-imaging or autoradiography. Duplicate 
experiments are performed and a quantitative analysis of differentially expressed mRNAs is then 
performed. 

20 Alternatively, expression analysis of GENSET genes can be done through high density 

nucleotide arrays as described by Lockhart et aL, (1996) Nature Biotechnology 14: 1675-1680 and 
Sosnowski, et aL, (1997) Proc Natl Acad Sci U S A 94:1 1 19-1 123, which disclosures are hereby 
incorporated by reference in their entireties. Oligonucleotides of 15-50 nucleotides corresponding 
to sequences of a GENSET polynucleotide or fragments thereof are synthesized directly on the 

25 chip (Lockhart et aL, supra) or synthesized and then addressed to the chip (Sosnowski et aL, 

supra). Preferably, the oligonucleotides are about 20 nucleotides in length. cDNA probes labeled 
with an appropriate compound, such as biotin, digoxigenin or fluorescent dye, are synthesized 
from the appropriate mRNA population and then randomly fragmented to an average size of 50 to 
100 nucleotides. The said probes are then hybridized to the chip. After washing as described in 

30 Lockhart et aL, (supra) and application of different electric fields (Sosnowsky et aL, supra), the 
dyes or labeling compounds are detected and quantified. Duplicate hybridizations are performed. 
Comparative analysis of the intensity of the signal originating from cDNA probes on the same 
target oligonucleotide in different cDNA samples indicates a differential expression of the 
GENSET polypeptide-encoding mRNA. 

35 Uses of GENSET gene expression data 

Once the expression levels and patterns of a GENSET polypeptide-encoding mRNA has 
been determined using any technique known to those skilled in the art, in particular those described 
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in the section entitled "Evaluation of Expression Levels and Patterns of GENSET polypeptide- 
encoding mRNAs", or using the instant disclosure, these information may be used to design 
GENSET gene specific markers for detection, identification, screening and diagnosis purposes as 
well as to design DNA constructs with an expression pattern similar to a GENSET gene expression 
5 pattern. 

Detection of GENSET polypeptide expression and/or biological activity 
The invention further relates to methods of detection of GENSET polypeptide expression and/or 
biological activity in a biological sample using the polynucleotide and polypeptide sequences 
10 described herein. Such method scan be used, for example, as a screen fcr normal or abnormal 
GENSET polypeptide expression and/or biological activity and, thus, can be used diagnostically . 
The biological sample for use in the methods of the present invention includes a suitable sample 
: from, for example, a mammal, particularly a human. 
Detection of GENSET polypeptides 
15 The invention further relates to methods of detection of GENSET polypeptide or encoding 

polynucleotides in a sample using the sequences described herein and any techniques known to 
those skilled in the art. For example, a labeled polynucleotide probe having all or a functional 
portion of the nucleotide sequence of a GENSET polypeptide-encoding polynucleotide can be used 
in a method to detect a GENSET polypeptide-encoding polynucleotide in a sample. In one 
20 embodiment, the sample is treated to render the polynucleotides in the sample available for 

hybridization to a polynucleotide probe, which can be DNA or RNA. The resulting treated sample 
is combined with a labeled polynucleotide probe having all or a portion of the nucleotide sequence 
. of the GENSET polypeptide-encoding cDNA or genomic sequence, under conditions appropriate 
for hybridization of complementary sequences to occur. Detection of hybridization of 
. 25 polynucleotides from the sample with the labeled nucleic probe indicates the presence of GENSET 
. polypeptide-encoding polynucleotides in a sample. The presence of GENSET polypeptide- 
encoding mRNA is indicative of GENSET polypeptide-encoding gene expression. 

Consequently, the invention comprises methods for detecting the presence of a 
polynucleotide comprising a nucleotide sequence selected from a group consisting of the 
30 polynucleotide sequences of the Sequence Listing, those of human cDNA clone inserts of the 
deposited clone pool, sequences fully complementary thereto, fragments and variants thereof in a 
sample. In a first embodiment, said method comprises the following steps of: 

a) bringing into contact said sample and a nucleic acid probe or a plurality of nucleic 
acid probes which hybridize to said selected nucleotide sequence; and 
35 b) detecting the hybrid complex formed between said probe or said plurality of 

probes and said polynucleotide. 
In a preferred embodiment of the above detection method, said nucleic acid probe or said 
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plurality of nucleic acid probes is labeled with a detectable molecule. In another preferred 
embodiment of the above detection method, said nucleic acid probe or said plurality of nucleic acid 
probes has been immobilized on a substrate. In still another preferred embodiment, said nucleic 
acid probe or said plurality of nucleic acid probes has a sequence comprised in a sequence 
5 complementary to said selected sequence. 

In a second embodiment, said method comprises the steps of: 

a) contacting said sample with amplification reaction reagents comprising a pair of 
amplification primers located on either side of the region of said nucleotide 
sequence to be amplified; 
1 0 b) performing an amplification reaction to synthesize amplification products 

containing said region of said selected nucleotide sequence; and 

c) detecting said amplification products. 

In a preferred embodiment of the above detection method, when the polynucleotide to be 
amplified is a KNA molecule, preliminary reverse transcription and synthesis of a second cDNA 

15 strand are necessary to provide a DNA template to be amplified. In another preferred embodiment 
of the above detection method, the amplification product is detected by hybridization with a 
labeled probe having a sequence which is complementary to the amplified region. In still another 
preferred embodiment, at least one of said amplification primer has a sequence comprised in said 
selected sequence or in the sequence complementary to said selected sequence. 

20 Alternatively, a method of detecting GENSET polypeptide expression in a test sample can 

be accomplished using any product which binds to a GENSET olypeptide of the present invention 
or a portion of a GENSET polypeptide. Such products may be antibodies, binding fragments of 
antibodies, polypeptides able to bind specifically to GENSET polypeptides or fragments thereof, 
including GENSET polypeptide agonists and antagonists. Detection of specific binding to the 

25 antibody indicates the presence of a GENSET polypeptide in the sample (e.g., ELISA). 

Consequently, the invention is also directed to a method for detecting specifically the 
presence of a GENSET polypeptide according to the invention in a biological sample, said method 
comprising the steps of: 

a) bringing into contact said biological sample with a product able to bind to a 
30 polypeptide of the invention or fragments thereof; 

b) allowing said product to bind to said polypeptide to form a complex; and 

c) detecting said complex. 

In a preferred embodiment of the above detection method, the product is an antibody. In a 
more preferred embodiment, said antibody is labeled with a detectable molecule. In another more 
35 preferred embodiment of the above detection method, said antibody has been immobilized on a 
substrate. 

Li addition, the invention also relates to methods of determining whether a GENSET gene product 
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(e.g. a polynucleotide or polypeptide) is present or absent in a biological sample, said methods 
comprising the steps of: 

a) obtaining said biological sample from a human or non-human animal, 
preferably a mammal; 

5 b) contacting said biological sample wife a product able to bind to a GENSET 

polypeptide or encoding polynucleotide of the invention; and 
c) determining the presence or absence of said GENSET polypeptide-encoding 
gene product in said biological sample. 
The present invention also relates to kits that can be used in the detection of GENSET 
10 polypeptide-encoding gene expression products. The kit can comprise a compound that 

specifically binds a GENSET polypeptide (e.g. binding proteins, antibodies or binding fragments 
thereof (e.g. F(ab02 fragments) or a GENSET polypeptide-encoding mRNA (e.g. a complementary 
probe or primer), for example, disposed within a container means. The kit can further comprise 
ancillary reagents, including buffers and the like. 
15 Detection of GENSET polypeptide biological activity 

The invention further includes methods of detecting specifically a GENSET polypeptide 
biological activity, and to identify compounds capable of modulating the activity of a GENSET 
polypeptide. Assessing the GENSET polypeptide biological activity may be performed by the 
detection of a change in any cellular property associated with the GENSET polypeptide, using a 
20 variety of techniques, including those described herein. To identify modulators of the 

polypeptides, a control is preferably used. For example, a control sample includes all of the same 
, reagents but lacks the compound or agent being assessed; it is treated in the same manner as the 
test sample. 

The present invention also relates to kits that can be used in the detection of GENSET 
25 polypeptide biological activity. The kit can comprise, e.g. substrates for GENSET polypeptides, 
GENSET-binding compounds, antibodies to GENSET polypeptides, etc., for example, disposed 
within a container means. The kit can further comprise ancillary reagents, including buffers and the 
like. 

30 Identification of a specific context of GENSET polypeptide-encoding gene expression 

When the expression pattern of a GENSET polypeptide-encoding mRNA shows that a 
GENSET polypeptide-encoding gene is specifically expressed in a given context, probes and 
primers specific for this gene as well as antibodies binding to the GENSET polypeptide-encoding 
polynucleotide may then be used as markers for the specific context. Examples of specific 

35 contexts are: specific expression in a given tissue/cell or tissue/cell type, expression at a given 
stage of development of a process such as embryo development or disease development, or specific 
expression in a given organelle. Such primers, probes, and antibodies are useful commercially to 
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identify tissues/cells/organelles of unknown origin, for example, forensic samples, differentiated 
tumor tissue that has metastasized to foreign bodily sites, or to differentiate different tissue types in 
a tissue cross-section using any technique known to those skilled in the art including in situ PCR or 
immunochemistry for example. 
5 For example, the cDNAs and proteins of the sequence listing and fragments thereof, may 

be used to distinguish human tissues/cells from non-human tissues/cells and to distinguish between 
human tissues/cells/organelles that do and do not express the polynucleotides comprising the 
cDNAs. By knowing the expression pattern of a given GENSET polypeptide, either through 
routine experimentation or by using the instant disclosure, the polynucleotides and polypeptides of 

1 0 the present invention may be used in methods of determining the identity of an unknown tissue/cell 
sample/organelle. As part of determining the identity of an unknown tissue/cell sample/organelle, 
the polynucleotides and polypeptides of the present invention may be used to determine what the 
unknown tissue/cell sample is and what the unknown sample is not. For example, if a cDNA is 
expressed in a particular tissue/cell type/organelle, and the unknown tissue/cell sample/organelle 

15 does not express the cDNA, it may be inferred that the unknown tissue/cells are either not human 
or not the same human tissue/cell type/organelle as that which expresses the cDNA. Determination 
of tissue/cell/organelle identity is based on methods that detect the presence or absence of the 
mRNA (or corresponding cDNA) in a tissue/cell sample using methods well known in the art (e.g., 
hybridization, PCR based methods, immunoassays, immunochemistry, ELISA). Examples of such 

20 techniques are described in more detail below. Therefore, the invention encompasses uses of the 
polynucleotides and polypeptides of the invention as tissue markers. Consequently, the present 
invention encompasses methods of identification of a tissue/cell type/subcellular compartment, 
wherein said method includes the steps of: 

a) contacting a biological sample which identity is to be assayed with a product able 
25 . to bind a GENSET gene product; and 

b) determining whether a GENSET gene product is expressed in said biological 
sample. 

Products that are able to bind specifically to a GENSET gene product, namely a GENSET 
polypeptide or a GENSET polypeptide-encoding mRNA, include GENSET polypeptide binding 
30 proteins, antibodies or binding fragments thereof (e.g. F(ab , )2 fragments), as well as GENSET 
polynucleotide complementary probes and primers. 

Step b) may be performed using any detection method known to those skilled in the art 
including those disclosed herein, especially in the section entitled 'Detection of GENSET 
polypeptide expression and/or biological activity". 

35 

Identification of Tissue Types or Cell Species by Means of Labeled Tissue Specific Antibodies 
Identification of specific tissues is accomplished by the visualization of tissue specific 
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antigens by means of antibody preparations which are conjugated, directly (e.g., green fluorescent 
protein) or indirectly to a detectable marker. Selected labeled antibody species bind to their 
specific antigen binding partner in tissue sections, cell suspensions, or in extracts of soluble 
proteins from a tissue sample to provide a pattern for qualitative or semi-qualitative interpretation. 

5 

A. Immunohistochemical Techniques 

Purified, high-titer antibodies, prepared as described above, are conjugated to a detectable 
marker, as described, for example, by Fudenberg, (1980) Chap. 26 in: Basic 503 Clinical 
Immunology, 3rd Ed. Lange, Los Altos, California or Rose et ai 9 (1980) Chap. 12 in: Methods in 

10 Immunodiagnosis, 2d Ed. John Wiley 503 Sons, New York, which disclosures are hereby 
incorporated by reference in their entireties. 

A fluorescent marker, either fluorescein or rhodamine, is preferred, but antibodies can also 
be labeled with an enzyme that supports a color producing reaction with a substrate, such as 
horseradish peroxidase. Markers can be added to tissue-bound antibody in a second step, as 

15 described below. Alternatively, the specific anti-tissue antibodies can be labeled with ferritin or 
other .electron dense particles, and localization of the ferritin coupled antigen-antibody complexes 
achieved by means of an electron microscope. In yet another approach, the antibodies are 
radiolabeled, with, for example 125 I, and detected by overlaying the antibody treated preparation 
with photographic emulsion. Preparations to carry out the procedures can comprise monoclonal or 

20 polyclonal antibodies to a single protein or peptide identified as specific to a tissue type, for 
example, brain tissue, or antibody preparations to several antigenically distinct tissue specific 
antigens can be used in panels, independently or in mixtures, as required. Tissue sections and cell 
suspensions are prepared for immunohistochemical examination according to common histological 
techniques. Multiple cryostat sections (about 4 urn, unfixed) of the unknown tissue and known 

25 control, are mounted and each slide covered with different dilutions of the antibody preparation. 
Sections of known and unknown tissues should also be treated with preparations to provide a 
positive control, a negative control, for example, pre-immune sera, and a control for non-specific 
staining, for example, buffer. Treated sections are incubated in a humid chamber for 30 min at 
room temperature, rinsed, then washed in buffer for 30-45 min. Excess fluid is blotted away, and 

30 the marker developed. If the tissue specific antibody was not labeled in the first incubation, it can 
be labeled at this time in a second antibody-antibody reaction, for example, by adding fluorescein- 
or enzyme-conjugated antibody against the immunoglobulin class of the antiserum-producing 
species, for example, fluorescein labeled antibody to mouse IgG. Such labeled sera are 
commercially available. The antigen found in the tissues by the above procedure can be quantified 

35 by measuring the intensity of color or fluorescence on the tissue section, and calibrating that signal 
using appropriate standards. 
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B. Identification of Tissue Specific Soluble Proteins 

The visualization of tissue specific proteins and identification of unknown tissues from that 
procedure is carried out using the labeled antibody reagents and detection strategy as described for 
immunohistochemistry; however the sample is prepared according to an electrophoretic technique 
5 to distribute the proteins extracted from the tissue in an orderly array on the basis of molecular 
weight for detection. For example, Western Blot Analysis, see, e.g., Davis et al. y Basic Methods in 
Molecular Biology, ed., Elsevier Press, NY (1986), Section 19-3. 

In either procedure A or B, a detectable label can be attached to the primary tissue antigen- 
primary antibody complex according to various strategies and permutations thereof. In a 

10 straightforward approach, the primary specific antibody can be labeled; alternatively, the unlabeled 
complex can be bound by a labeled secondary anti-IgG antibody. In other approaches, either the 
primary or secondary antibody is conjugated to a biotin molecule, which can, in a subsequent step, 
bind an avidin conjugated marker. According to yet another strategy, en2yme labeled or 
radioactive protein A, which has the property of binding to any IgG, is bound in a final step to 

15 either the primary or secondary antibody. The visualization of tissue specific antigen binding at 
levels above those seen in control tissues to one or more tissue specific antibodies, prepared from 
the gene sequences identified from cDNA sequences, can identify tissues of unknown origin, for 
example, forensic samples, or differentiated tumor tissue that has metastasized to foreign bodily 
sites. 

20 

Screening and diagnosis of abnormal GENSET polypeptide expression and/or biological activity 

Moreover, antibodies and/or primers specific for GENSET polypeptide expression may 
also be used to identify abnormal GENSET polypeptide expression and/or biological activity, and 
. subsequently to screen and/or diagnose disorders associated with abnormal GENSET polypeptide 

25 expression. For example, a particular disease may result from lack of expression, over expression, 
or under expression of a GENSET polypeptide-encoding mRNA. By comparing mRNA 
expression patterns and quantities in samples taken from healthy individuals with those from 
individuals suffering from a particular disorder, genes responsible for this disorder may be 
identified. Primers, probes and antibodies specific for this GENSET polypeptide may then be used 

30 to elaborate kits of screening and diagnosis for a disorder in which the gene of interest is 

specifically expressed or in which its expression is specifically dysregulated, i.e. underexpressed or 
overexpressed. 

Screening for specific disorders 
35 The present invention also relates to methods and uses of GENSET polypeptides for 

identifying individuals having elevated or reduced levels of GENSET polypeptides, which 
individuals are likely to benefit from therapies to suppress or enhance GENSET polypeptide- 
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encoding gene expression, respectively. One example of such methods and uses comprises the 
steps of: 

a) obtaining from a mammal a biological sample; 

b) detecting the presence in said sample of a GENSET polypeptide-encoding gene 
5 product (mRNA or protein); 

c) comparing the amount of said GENSET polypeptide-encoding gene product 
present in said sample with that of a control sample; and 

d) determining whether said human or non-human mammal has a reduced or elevated 
level of GENSET gene expression compared to the control sample. 

10 A biological sample from a subject affected by, or at risk of developing, any disease or 

condition associated with a GENSET polypeptide can be screened for the presence of increased or 
decreased levels of GENSET gene product, relative to a normal population (standard or control), 
with an increased or decreased level of the GENSET polypeptide relative to the normal population 
being indicative of predisposition to or a present indication of the disease or condition, or any 

15 sympton associated with the disease or condition. Such individuals would be candidates for 

therapies, e.g., treatment with pharmaceutical compositions comprising the GENSET polypeptide, 
a polynucleotide encoding the GENSET polypeptide, or any other compound that affects the 
expression or activity of the GENSET polypeptide. Generally, the identification of elevated levels 
of the GENSET polypeptide in a patient would be indicative of an individual that would benefit 

20 from treatment with agents that suppress GENSET polypeptide expression or activity, and the 
identification of low levels of the GENSET polypeptide in a patient would be indicative of an 
individual that would benefit from agents that induce GENSET expression or activity. 

Biological samples suitable for use in this method include any biological fluids, including, 
but not limited to, blood, saliva, milk, and urine. Tissue samples (e.g. biopsies) can also be used in 

25 the method of the invention, including samples derived from any tissue associated with GENSET 
gene expression as determined by any method common to the art such as those described herein.. 
Cell cultures or cell extracts derived, for example, from tissue biopsies can also be used. The 
detection step of the present method can be performed using standard protocols for protein/mRNA 
detection. Examples of suitable protocols include Northern blot analysis, immunoassays (e.g. RIA, 

30 Western blots, immimohistochemical analyses), and PCR. 

Thus, the present invention further relates to methods and uses of GENSET polypeptides 
for identifying individuals or non-human animals at increased risk for developing, or present state 
of having, certain diseases/disorders associated with abnormal GENSET polypeptide expression or 
biological activity. One example of such methods comprises the steps of: 

35 a) obtaining from a human or non-human mammal a biological sample; 

b) detecting the presence in said sample of a GENSET gene product (mRNA 
or protein); 
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c) comparing the amount of said GENSET gene product present in said 
sample with that of a control sample; and 

d) determing whether said human or non-human mammal is at increased risk 
for developing, or present state of having, a diseases or disorder. 

5 In preferred embodiments, the biological sample is taken from animals presenting any 

symptom associated with any disease or condition associated with a GENSET gene product. In 
accordance with this method, the presence in the sample of altered (e.g. increased or decreased) 
levels of the GENSET product indicates that the subject is predisposed to the disease or condition. 
Biological samples suitable for use in this method include biological fluids including, but not 
10 limited to, blood, saliva, milk, and urine. Tissue samples (e.g. biopsies) can also be used in the 
method of the invention. Cell cultures or cell extracts derived, for example, from tissue biopsies 
can also be used. 

The diagnostic methodologies described herein are applicable to both humans and 
non-human mammals. 

15 

Detection of GENSET gene mutations 

The invention also encompasses methods and uses of GENSET polynucleotides to detect 
mutations in GENSET polynucleotides of the invention. Such methods may advantageously be 
used to detect mutations occurring in GENSET genes and preferably in their regulatory regions. 

20 When the mutation was proven to be associated with a disease, the detection of such mutations 
may be used for screening and diagnosis purposes. 

In one embodiment of the oligonucleotide arrays of the invention, an oligonucleotide probe 
matrix may advantageously be used to detect mutations occurring in GENSET genes and 
preferably in their regulatory regions. For this particular purpose, probes are specifically designed 

25 to have a nucleotide sequence allowing their hybridization to the genes that carry known mutations 
(either by deletion, insertion or substitution of one or several nucleotides). By known mutations, it 
is meant, mutations on the GENSET genes that have been identified according, for example to the 
technique used by Huang et al 9 (1996) Cancer Res 56(5): 1137-1141 or Samson et aL, (1996) 
Nature, 382(6593):722-725, which disclosures are hereby incorporated by reference in their 

30 entireties. 

Another technique that is used to detect mutations in GENSET genes is the use of a high- 
density DNA array. Each oligonucleotide probe constituting a unit element of the high density 
DNA array is designed to match a specific subsequence of a GENSET genomic DNA or cDNA. 
Thus, an array consisting of oligonucleotides complementary to subsequences of the target gene 
35 sequence is used to determine the identity of the target sequence with the wild gene sequence, 
measure its amount, and detect differences between the target sequence and the reference wild 
gene sequence of the GENSET gene. In one such design, termed 4L tiled array, is implemented a 
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set of four probes (A, C, G, T), preferably 15-rmcleotide oligomers. In each set of four probes, the 
perfect complement will hybridize more strongly than mismatched probes. Consequently, a 
nucleic acid target of length L is scanned for mutations with a tiled array containing 4L probes, the 
whole probe set containing all the possible mutations in the known wild reference sequence. The 
5 hybridization signals of the 15-mer probe set tiled array are perturbed by a single base change in 
the target sequence. As a consequence, there is a characteristic loss of signal or a "footprint" for 
the probes flanking a mutation position. This technique was described by Chee et al, (1996) 
Science. 274:610-614, which disclosure is hereby incorporated by reference in its entirety. 

10 Construction of DNA constructs with a GENSET gene expression pattern 

In addition, characterization of the spatial and temporal expression patterns and expression 
levels of GENSET polypeptide-encoding mRNAs is also useful for constructing expression vectors 
capable of producing a desired level of gene product in a desired spatial or temporal manner, as 
discussed below. 

15 

DNA Constructs That Direct Temporal And Spatial GENSET Gene Expression In Recombinant 
Cell Hosts And In Transgenic Animals. 

In order to study the physiological and phenotypic consequences of a lack of synthesis of a 
GENSET polypeptide, both at the cellular level and at the multi cellular organism level, the 

20 invention also encompasses DNA constructs and recombinant vectors enabling a conditional 
expression of a specific allele of a GENSET polypeptide-encoding genomic sequence or cDNA 
and also of a copy of this genomic sequence or cDNA harboring substitutions, deletions, or 
additions of one or more bases as regards to a polynucleotide of the present regulatory sequence, 
but preferably in the S'-regulatory sequence or in an exon of the GENSET polypeptide-encoding 

25 genomic sequence or within the GENSET polypeptide-encoding cDNA. 

A first preferred DNA construct is based on the tetracycline resistance operon let from E. 
coli transposon TnlO for controlling the GENSET gene expression, such as described by Gossen et 
al, (1992) Proc. Natl. Acad. Sci. USA. 89:5547-5551; Gossen et a/., (1995) Science 268:1766- 
1769; and Furth P.A. et al (1994) Proc. Natl. Acad. Sci USA. 91 :9302-9306, which disclosures are 

30 hereby incorporated by reference in their entireties. Such a DNA construct contains seven tet 

operator sequences from TnlO (te/op) that are fused to either a minimal promoter or a 5 5 -regulatory 
sequence of the GENSET gene, said minimal promoter or said GENSET polynucleotide regulatory 
sequence being operably linked to a polynucleotide of interest that codes either for a sense or an 
antisense oligonucleotide or for a polypeptide, including a GENSET polypeptide, or a peptide 

35 fragment thereof. This DNA construct is functional as a conditional expression system for the 
nucleotide sequence of interest when the same cell also comprises a nucleotide sequence coding 
for either the wild type (tTA) or the mutant (rTA) repressor fused to the activating domain of viral 
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protein VP 16 of herpes simplex virus, placed under the control of a promoter, such as the 
HCMVBE1 enhancer/promoter or the MMTV-LTR. Indeed, a preferred DNA construct of the 
invention comprise both the polynucleotide containing the tet operator sequences and the 
polynucleotide containing a sequence coding for the tTA or the rTA repressor. In a specific 
5 embodiment, the conditional expression DNA construct contains the sequence encoding the mutant 
tetracycline repressor rTA, the expression of the polynucleotide of interest is silent in the absence 
of tetracycline and induced in its presence. 

DNA Constructs Allowing Homologous Recombination: Replacement Vectors 

10 A second preferred DNA construct will comprise, from 5'-end to 3 '-end: (a) a first 

nucleotide sequence that is found in the GENSET polypeptide-encoding genomic sequence; (b) a 
nucleotide sequence comprising a positive selection marker, such as the marker for neomycin 
resistance (neo); and (c) a second nucleotide sequence that is found in the GENSET polypeptide- ■ 
encoding genomic sequence, and is located on the genome downstream the first GENSET 

15 polypeptide-encoding nucleotide sequence (a). 

In a preferred embodiment, this DNA construct also comprises a negative selection marker 
located upstream of the nucleotide sequence (a) or downstream from the nucleotide sequence (c). 
Preferably, the negative selection marker comprises the thymidine kinase (tk) gene [Thomas et al 
(1986), Cell. 44:419-428], the hygromycine beta gene [Te Riele et al (1990), Nature. 348:649- 

20 651], the hprt gene [Van der Lugt et al (1991), Gene. 105:263-267; Reid et al, (1990) Proc. Natl. 
Acad. Sci. U.S.A. 87:4299-4303] or the Diphteria toxin A fragment (Dt-A) gene [Nada et al 9 
(1993) Cell 73:1 125-1135; Yagi, T., et al (1990), Proc. Natl. Acad. Sci. U.S.A. 87:9918-992], 
- which disclosures are hereby incorporated by reference in their entireties. Preferably, the positive 
selection marker is located within a GENSET exon sequence so as to interrupt the sequence 

25 encoding a GENSET polypeptide. These replacement vectors are described, for example, by 
Thomas et a/.(1986; 1987), Mansour et a/.(1988) and Koller et al, (1992) Annu. Rev. Immunol. 
10:705-730. 

The first and second nucleotide sequences (a) and (c) may be indifferently located within a 
GENSET polypeptide-encoding regulatory sequence, an intronic sequence, an exon sequence or a 
30 sequence containing both regulatory and/or intronic and/or exon sequences. The size of the 
nucleotide sequences (a) and (c) ranges from 1 to 50 kb, preferably from 1 to 10 kb, more 
preferably from 2 to 6 kb and most preferably from 2 to 4 kb. 

DNA Constructs Allowing Homologous Recombination: Cre~LoxP System. 
35 These new DNA constructs make use of the site specific recombination system of the PI 

phage. The PI phage possesses a recombinase called Cre which interacts specifically with a 34 
base pairs loxP site. The loxP site is composed of two palindromic sequences of 13 bp separated 
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by a 8 bp conserved sequence [Hoess et al 9 (1986) Nucleic Acids Res. 14:2287-2300], which 
disclosure is hereby incorporated by reference in its entirety. The recombination by the Cre 
en2yme between two loxP sites having an identical orientation leads to the deletion of the DNA 
fragment. 

5 The Cre-tocP system used in combination with a homologous recombination technique has 

been first described by Gu H. et al 9 (1993) Cell 73:1155-1 164 and Gu H. et aL, (1994) Science 
265 : 103-106, which disclosures are hereby incorporated by reference in their entireties. Briefly, a 
nucleotide sequence of interest to be inserted in a targeted location of the genome harbors at least 
two loxP sites in the same orientation and located at the respective ends of a nucleotide sequence to 

10 be excised from the recombinant genome. The excision event requires the presence of the 
recombinase (Cre) enzyme within the nucleus of the recombinant cell host. The recombinase 
enzyme may be brought at the desired time either by (a) incubating the recombinant cell hosts in a 
culture medium containing this enzyme, by injecting the Cre enzyme directly into the desired cell, 
such as described by Araki et aL 9 (1995) Proc. Natl. Acad. Sci. USA. 92(1): 160-4, which 

1 5 disclosure is hereby incorporated by reference in its entirety, or by lipofection of the enzyme into 
the cells, such as described by Baubonis et al (1993) Nucleic Acids Res. 21(9):2025-9), which 
disclosure is hereby incorporated by reference in its entirety; (b) transfecting the cell host with a 
vector comprising the Cre coding sequence operably linked to a promoter functional in the 
recombinant cell host, which promoter being optionally inducible, said vector being introduced in 

20 the recombinant cell host, such as described by Gu et al. (1993) and Sauer et al., (1988) Proc. Natl. 
Acad. Sci. U.S.A. 85:5166-5170, which disclosures are hereby incorporated by reference in their 
entireties; (c) introducing in the genome of the cell host a polynucleotide comprising the Cre 
coding sequence operably linked to a promoter functional in the recombinant cell host, which 
promoter is optionally inducible, and said polynucleotide, being inserted in the genome of the cell 

25 host either by a random insertion event or an homologous recombination event, such as described 
byGuefa/.(1994). 

In a specific embodiment, the vector containing the sequence to be inserted in the 
GENSET gene by homologous recombination is constructed in such a way that selectable markers 
are flanked by loxP sites of the same orientation, it is possible, by treatment by the Cre enzyme, to 

30 eliminate the selectable markers while leaving the GENSET sequences of interest that have been 
inserted by an homologous recombination event. Again, two selectable markers are needed: a 
positive selection marker to select for the recombination event and a negative selection marker to 
select for the homologous recombination event. Vectors and methods using the Cre-lox? system 
are described by Zou, et al, (1994) Curr. Biol. 4:1099-1 103, which disclosure is hereby 

35 incorporated by reference in its entirety. 

Thus, a third preferred DNA construct of the invention comprises, from 5'-end to 3'-end: 
(a) a first nucleotide sequence that is comprised in the GENSET genomic sequence; (b) a 
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nucleotide sequence comprising a polynucleotide encoding a positive selection marker, said 
nucleotide sequence comprising additionally two sequences defining a site recognized by a 
recombinase, such as a loxP site, the two sites being placed in the same orientation; and (c) a 
second nucleotide sequence that is comprised in the GENSET genomic sequence, and is located on 
5 the genome downstream of the first GENSET nucleotide sequence (a). 

The sequences defining a site recognized by a recombinase, such as a lox? site, are 
preferably located within the nucleotide sequence (b) at suitable locations bordering the nucleotide 
sequence for which the conditional excision is sought. In one specific embodiment, two loxP sites 
are located at each side of the positive selection marker sequence, in order to allow its excision at a 

1 0 desired time after the occurrence of the homologous recombination event. 

In a preferred embodiment of a method using the third DNA construct described above, the 
excision of the polynucleotide fragment bordered by the two sites recognized by a recombinase, 
preferably two loxP sites, is performed at a desired time, due to the presence within the genome of 
the recombinant host cell of a sequence encoding the Cre enzyme operably linked to a promoter 

15 sequence, preferably an inducible promoter, more preferably a tissue-specific promoter sequence 
and most preferably a promoter sequence which is both inducible and tissue-specific, such as 
described by Gu et a/.(1994). 

The presence of the Cre enzyme within the genome of the recombinant cell host may result 
from the breeding of two transgenic animals, the first transgenic animal bearing the GENSET- 

20 derived sequence of interest containing the lox? sites as described above and the second transgenic 
animal bearing the Cre coding sequence operably linked to a suitable promoter sequence, such as 
described by Gu et a/.(1994). 

Spatio-temporal control of the Cre enzyme expression may also be achieved with an adenovirus 
based vector that contains the Cre gene thus allowing infection of cells, or in vivo infection of 

25 organs, for delivery of the Cre enzyme, such as described by Anton and Graham, (1995), J. Virol., 
69: 4600-4606 and Kanegae et al, (1995) Nucl. Acids Res. 23:3816-3821, which disclosures are 
hereby incorporated by reference in their entireties. 

The DNA constructs described above may be used to introduce a desired nucleotide 
sequence of the invention, preferably a GENSET genomic sequence or a GENSET cDNA 

30 sequence, and most preferably an altered copy of a GENSET genomic or cDNA sequence, within a 
predetermined location of the targeted genome, leading either to the generation of an altered copy 
of a targeted gene (knock-out homologous recombination) or to the replacement of a copy of the 
targeted gene by another copy sufficiently homologous to allow an homologous recombination 
event to occur (knock-in homologous recombination). 

35 

Modifying genset polypoptide expression and/or biological activity 
Modifying endogenous GENSET expression and/or biological activity is expressly 
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contemplated by the present invention. 

Screening for compounds that modulate GENSET expression and/or biological activity 

The present invention further relates to compounds able to modulate GENSET expression 
and/or biological activity and methods to use these compounds. Such compounds may interact 
5 with the regulatory sequences of GENSET genes or they may interact with GENSET polypeptides 
directly or indirectly. 

Compounds Interacting With GENSET Regulatory Sequences 

The present invention also concerns a method for screening substances or molecules that 
are able to interact with the regulatory sequences of a GENSET gene, such as for example 

10 promoter or enhancer sequences in untranscribed regions of the genomic DNA, as determined 
using any techniques known to those skilled in the art including those described in the section 
entitled "Identification of Promoters in Cloned Upstream Sequences, or such as regulatory 
sequences located in untranslated regions of GENSET mRNA. 

Sequences within untranscribed or untranslated regions of polynucleotides of the invention 

15 may be identified by comparison to databases containing known regulatory sequence such as 

transcription start sites, transcription factor binding sites, promoter sequences, enhancer sequences, 
5'UTR and 3'UTR elements (Pesole et a!., (2000) Nucleic Acids Res, 28(1): 193-196; http://igs- 
senrer.airs-mrs.jQ:/^gauthere/UTR/index.html]. Alternatively, the regulatory sequences of interest 
may be identified through conventional mutagenesis or deletion analyses of reporter plasmids 

20 using, for instance, techniques described in the section entitled "Identification of Promoters in 
Cloned Upstream Sequences". 

Following the identification of potential GENSET regulatory sequences, proteins which 
interact with these regulatory sequences may be identified as described below. 

Gel retardation assays may be performed independently in order to screen candidate 

25 molecules that are able to interact with the regulatory sequences of the GENSET gene, such as 
described by Fried and Crothers, (1981) Nucleic Acids Res. 9:6505-6525, Garner and Revzin, 
(1981) Nucleic Acids Res 9:3047-3060 and Dent and Latchman (1993) The DNA mobility shift 
assay. In: Transcription Factors: A Practical Approach (Latchman DS, ed.) ppl-26. Oxford: IRL 
Press, the teachings of these publications being herein incorporated by reference. These techniques 

30 are based on the principle according to which a DNA or mRNA fragment which is bound to a 
protein migrates slower than the same unbound DNA or mRNA fragment. Briefly, the target 
nucleotide sequence is labeled. Then the labeled target nucleotide sequence is brought into contact 
with either a total nuclear extract from cells containing regulation factors, or with different 
candidate molecules to be tested. The interaction between the target regulatory sequence of the 

35 GENSET gene and the candidate molecule or the regulation factor is detected after gel or capillary 
electrophoresis through a retardation in the migration. 

Nucleic acids encoding proteins which are able to interact with the promoter sequence of 
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the GENSET gene, more particularly the polynucleotides of the 5' and 3' regulatory region or a 
fragment or variant thereof, may be identified by using a one-hybrid system, such as that described 
in the booklet enclosed in the Matchmaker One-Hybrid System kit from Clontech (Catalog Ref. 
no. Kl 603-1, the technical teachings of which are herein incorporated by reference). 

5 

Lieands interacting with GENSET polypeptides 

For the purpose of the present invention, a ligand means a molecule, such as a protein, a 
peptide, an antibody or any synthetic chemical compound capable of binding to a GENSET protein 
or one of its fragments or variants or to modulate the expression of the polynucleotide coding for 
10 GENSET or a fragment or variant thereof. 

In the ligand screening method according to the present invention, a biological sample or a 
defined molecule to be tested as a putative ligand of a GENSET protein is brought into contact 
with the corresponding purified GENSET protein, for example the corresponding purified 
recombinant GENSET protein produced by a recombinant cell host as described herein, in order to 
1 5 form a complex between this protein and the putative ligand molecule to be tested. 

As an illustrative example, to study the interaction of a polypeptide of the invention, with 
drugs or small molecules, such as molecules generated through combinatorial chemistry 
approaches, the microdialysis coupled to HPLC method described by Wang, et al (1997), 
Chromatographic 44 : 205-208 or the affinity capillary electrophoresis method described by Bush 
20 et a!., (1997), J. Chromatogr., 777 : 3 1 1-328, the disclosures of which are incoiporated by 
reference, can be used. 

In further methods, peptides, drugs, fatty acids, lipoproteins, or small molecules which 
interact with a may be identified using assays known in the art. For example, the molecule to be 
tested for binding is labeled with a detectable label, such as a fluorescent, radioactive, or enzymatic 
25 tag and placed in contact with immobilized GENSET protein, or a fragment thereof under 
conditions which permit specific binding to occur. After removal of non-specifically bound 
molecules, bound molecules are detected using appropriate means. 

Various candidate substances or molecules can be assayed for interaction with a GENSET 
polypeptide. These substances or molecules include, without being limited to, natural or synthetic 
30 organic compounds or molecules of biological origin such as polypeptides. When the candidate 
substance or molecule comprises a polypeptide, this polypeptide may be the resulting expression 
product of a phage clone belonging to a phage-based random peptide library, or alternatively the 
polypeptide may be the resulting expression product of a cDNA library cloned in a vector suitable 
for performing a two-hybrid screening assay. 

35 

A. Candid ate lipands obtained from random peptide libraries 

In a particular embodiment of the screening method, the putative ligand is the expression 
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product of a DNA insert contained in a phage vector [Parmley and Smith, (1988) Gene 73:305- 
318]. Specifically, random peptide phages libraries are used. The random DNA inserts encode for 
peptides of 8 to 20 amino acids in length [Oldenburg et al, (1992), Proc. Natl. Acad. Sci. USA 
89:5393-5397; Valadon et aL, (1996), J. Mol. Biol., 261:11-22; Lucas (1994), In: Development 
5 and Clinical Uses of Haempophilus b Conjugate; Westerink (1995), Proc. Natl. Acad. Sci USA., 
92:4021-4025; Felici, (1991), J. Mol. Biol., 222:301-310], which disclosures are hereby 
incorporated by reference in their entireties. According to this particular embodiment, the 
recombinant phages expressing a protein that binds to an immobilized GENSET protein is retained 
and the complex formed between the GENSET protein and the recombinant phage may be 
10 subsequently immunoprecipitated by a polyclonal or a monoclonal antibody directed against the 
GENSET protein. 

Once the ligand library in recombinant phages has been constructed, the phage population 
is brought into contact with the immobilized GENSET protein. Then the preparation of complexes 
is washed in order to remove the non-specifically bound recombinant phages. The phages that 

15 bind specifically to the GENSET protein are then eluted by a buffer (acid pH) or 

immunoprecipitated by the monoclonal antibody produced by the hybridoma anti-GENSET, and 
this phage population is subsequently amplified by an over-infection of bacteria (for example E. 
coli). The selection step may be repeated several times, preferably 2-4 times, in order to select the 
more specific recombinant phage clones. The last step comprises characterizing the peptide 

20 produced by the selected recombinant phage clones either by expression in infected bacteria and 
isolation, expressing the phage insert in another host-vector system, or sequencing the insert 
contained in the selected recombinant phages. 

B. Candidate lizands obtained by competition experiments. 

25 Alternatively, peptides, drugs or small molecules which bind to polypeptide of the present 

invention may be identified in competition experiments. In such assays, the GENSET protein, or a 
fragment thereof, is immobilized to a surface, such as a plastic plate. Increasing amounts of the 
peptides, drugs or small molecules are placed in contact with the immobilized GENSET protein, or 
a fragment thereof, in the presence of a detectable labeled known GENSET protein ligand. For 

30 example, the GENSET ligand may be detectably labeled with a fluorescent, radioactive, or 

enzymatic tag. The ability of the test molecule to bind the GENSET protein, or a fragment thereof, 
is determined by measuring the amount of detectably labeled known ligand bound in the presence 
of the test molecule. A decrease in the amount of known ligand bound to the GENSET protein, or 
a fragment thereof, when the test molecule is present indicated that the test molecule is able to bind 

35 to the GENSET protein, or a fragment thereof. 

C. Candidate lizands obtained bv affinity chromatography. 
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Proteins or other molecules interacting with a polypeptide of the present invention, can 
also be found using affinity columns which contain the GENSET protein, or a fragment thereof 
The GENSET protein, or a fragment thereof, may be attached to the column using conventional 
techniques including chemical coupling to a suitable column matrix such as agarose, Affi Gel® , or 
5 other matrices familiar to those of skill in art. In some embodiments of this method, the affinity 
column contains chimeric proteins in which the GENSET protein, or a fragment thereof, is fused to 
glutathion S transferase (GST). A mixture of cellular proteins or pool of expressed proteins as 
described above is applied to the affinity column. Proteins or other molecules interacting with the 
GENSET protein, or a fragment thereof, attached to the column can then be isolated and analyzed 
10 on 2-D electrophoresis gel as described in Ramunsen et ai, (1997), Electrophoresis, 18 : 588-598, 
the disclosure of which is incorporated by reference. Alternatively, the proteins retained on the 
affinity column can be purified by electrophoresis based methods and sequenced. The same 
method can be used to isolate antibodies, to screen phage display products, or to screen phage 
display human antibodies. 

15 

jD. Candidate lipands obtained bv optical biosensor methods 

Proteins interacting with a polypeptide of the present invention, can also be screened by 
using an Optical Biosensor as described in Edwards and Leatherbarrow, (1997) Analytical 
Biochemistry, 246, 1-6 and also in Szabo et al. 9 (1995) Curr Opin Struct Biol 5, 699-705, the 

20 disclosures of which are incorporated by reference. This technique permits the detection of 

interactions between molecules in real time, without the need of labeled molecules. This technique 
is based on the surface plasmon resonance (SPR) phenomenon. Briefly, the candidate ligand 
molecule to be tested is attached to a surface (such as a carboxymethyl dextran matrix). A light 
beam is directed towards the side of the surface that does not contain the sample to be tested and is 

25 reflected by said surface. The SPR phenomenon causes a decrease in the intensity of the reflected 
light with a specific association of angle and wavelength. The binding of candidate ligand 
molecules cause a change in the refraction index on the surface, which change is detected as a 
change in the SPR signal. For screening of candidate ligand molecules or substances that are able 
to interact with the GENSET protein, or a fragment thereof, the GENSET protein, or a fragment 

30 thereof, is immobilized onto a surface. This surface comprises one side of a cell through which 
. flows the candidate molecule to be assayed. The binding of the candidate molecule on the 
GENSET protein, or a fragment thereof, is detected as a change of the SPR signal. The candidate 
molecules tested may be proteins, peptides, carbohydrates, lipids, or small molecules generated by 
combinatorial chemistry. This technique may also be performed by immobilizing eukaryotic or 

35 prokaryotic cells or lipid vesicles exhibiting an endogenous or a recombinantly expressed 
GENSET protein at their surface. 

The main advantage of the method is that it allows the determination of the association rate 
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between the GENSET protein and molecules interacting with the GENSET protein. It is thus 
possible to select specifically ligand molecules interacting with the GENSET protein, or a 
fragment thereof, through strong or conversely weak association constants. 

5 E. Candidate lizands obtained through a two-hvbrid screening assay. 

The yeast two-hybrid system is designed to study protein-protein interactions in vivo 
(Fields and Song, 1989), which disclosure is hereby incorporated by reference in its entirety, and 
relies upon the fusion of a bait protein to the DNA binding domain of the yeast Gal4 protein. This 
technique is also described in the US Patent No. US 5,667,973 and the US Patent No. 5,283,173, 
10 the technical teachings of both patents being herein incorporated by reference. 

The general procedure of library screening by the two-hybrid assay may be performed as 
described by Harper et al, (1993), Cell, 75 : 805-816 or as described by Cho et a/., (1998), Proc. 
Natl. Acad. Sci. USA, 95(7):3752-3757 or also Fromont-Racine et al, (1997), Nature Genetics, 
16(3) : 277-282, which disclosures are hereby incorporated by reference in their entireties. 
1 5 The bait protein or polypeptide comprises a polypeptide of the present invention. 

More precisely, the nucleotide sequence encoding the GENSET polypeptide or a fragment 
or variant thereof is fused to a polynucleotide encoding the DNA binding domain of the GAL4 
protein, the fused nucleotide sequence being inserted in a suitable expression vector, for example 
pAS2orpM3. 

20 Then, a human cDNA library is constructed in a specially designed vector, such that the human 
cDNA insert is fused to a nucleotide sequence in the vector that encodes the transcriptional domain 
of the GAL4 protein. Preferably, the vector used is the pACT vector. The polypeptides encoded 
by the nucleotide inserts of the human cDNA library are termed "prey" polypeptides. 

A third vector contains a detectable marker gene, such as beta galactosidase gene or CAT 
25 gene that is placed under the control of a regulation sequence that is responsive to the binding of a 
complete Gal4 protein containing both the transcriptional activation domain and the DNA binding 
domain. For example, the vector pG5EC may be used. 

Two different yeast strains are also used. As an illustrative but non limiting example the 
two different yeast strains may be the followings : 
30 - 190, the phenotype of which is (MATa, Leu2-3, 1 12 ura3-12, trpl-901, his3-D200, ade2- 
101, gal4Dgall80D URA3 GAL-LacZ, LYS G4L-HIS3, cyh 1 ); 

187, the phenotype of which is (MATa gal4 gal80 his3 trpl-901 ade2-101 ura3-52 leu2-3, 
-1 12 URA3 GAL-lacZmef ), which is the opposite mating type of Y190. 

Briefly, 20 ug of pAS2/GENSET and 20 ug of pACT-cDNA library are co-transformed 
35 into yeast strain Y190. The transformants are selected for growth on minimal media lacking 
histidine, leucine and tryptophan, but containing the histidine synthesis inhibitor 3-AT (50 iuM). 
Positive colonies are screened for beta galactosidase by filter lift assay. The double positive 
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colonies (His + , beta-gal*) are then grown on plates lacking histidine, leucine, but containing 
tryptophan and cycloheximide (10 mg/ml) to select for loss of pAS2/GENSET plasmids but 
retention of pACT-cDNA library plasmids. The resulting Y190 strains are mated with Yl 87 
strains expressing GENSET or non-related control proteins; such as cyclophilin B, lamin, or SNF1, 
5 as Gal4 fusions as described by Harper et al (1 993) and by Bram et a/., (1 993), Mol. Cell Biol, 
13:4760-4769, which disclosures are hereby incorporated by reference in their entireties, and 
screened for beta galactosidase by filter lift assay. Yeast clones that are beta gal- after mating with 
the control Gal4 fusions are considered false positives. 

In another embodiment of the two-hybrid method according to the invention, interaction 

10 between the GENSET or a fragment or variant thereof with cellular proteins may be assessed using 
the Matchmaker Two Hybrid System 2 (Catalog No. Kl 604-1 , Clontech). As described in the 
manual accompanying the kit, the disclosure of which is incorporated herein by reference, nucleic 
acids encoding the GENSET protein or a portion thereof, are inserted into an expression vector 
such that they are in frame with DNA encoding the DNA binding domain of the yeast 

1 5 transcriptional activator GAL4. A desired cDNA, preferably human cDNA, is inserted into a 
second expression vector such that they are in frame with DNA encoding the activation domain of 
GAL4. The two expression plasmids are transformed into yeast and the yeast are plated on selection 
medium which selects for expression of selectable markers on each of the expression vectors as well 
as GAL4 dependent expression of the HIS3 gene. Transformants capable of growing on medium 

20 lacking histidine are screened for GAL4 dependent lacZ expression. Those cells which are positive in 
both the histidine selection and the lacZ assay contain interaction between GENSET and the protein or 
peptide encoded by the initially selected cDNA insert. 

Compounds Modulating GENSET biological activity 

25 Another method of screening for compounds that modulate GENSET expression and/or 

biological activity is by measuring the effects of test compounds on specific biological activity, 
e.g. a GENSET biological activity in a host cell. In one embodiment, the present invention relates 
to a method of identifying an agent which alters GENSET biological activity, wherein a nucleic 
acid construct comprising a nucleic acid which encodes a mammalian GENSET polypeptide is 

30 introduced into a host cell. The host cells produced are maintained under conditions appropriate 
for expression of the encoded mammalian GENSET polypeptides, whereby the nucleic acid is 
expressed. The host cells are then contacted with a compound to be assessed (an "agent," or "test 
agent"), and the properties of the cells are assessed. Detection of a change in any GENSET 
polypeptide-associated property in the presence of the agent indicates that the agent alters 

35 GENSET activity. In a particular embodiment, the invention relates to a method of identifying an 
agent which is an activator of GENSET activity, wherein detection of an increase of any GENSET 
polypeptide-associated property in the presence of the agent indicates that the agent activates 
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GENSET activity. In another particular embodiment, the invention relates to a method of 
identifying an agent which is an inhibitor of GENSET activity, wherein detection of a decrease of 
any GENSET polypeptide-associated property in the presence of the agent indicates that the agent 
inhibits GENSET activity. 
5 In a particular embodiment, a high throughput screen can be used to identify agents that 

activate (enhance) or inhibit GENSET activity (See e.g., PCT publication WO 98/45438, which 
disclosure is hereby incorporated by reference in its entirety). For example, the method of 
identifying an agent which alters GENSET activity can be performed as follows. A nucleic acid 
construct comprising a polynucleotide which encodes a mammalian GENSET polypeptide is 

10 introduced into a host cell to produce recombinant host cells. The recombinant host cells are then 
maintained under conditions appropriate for expression of the encoded mammalian GENSET 
polypeptide, whereby the nucleic acid is expressed. The compound to be assessed is added to the 
recombinant host cells; the resulting combination is referred to as a test sample. A detectable, 
GENSET polypeptide-associated property of the cells is detected. A control can be used in the 

15 methods of detecting agents which alter GENSET activity. For example, the control sample 
includes the same reagents but lacks the compound or agent being assessed; it is treated in the 
same manner as the test sample. 

Methods of Screening for Compounds Modulating GENSET Expression and/or Activity 
20 The present invention also relates to methods of screening compounds for their ability to 

modulate (e.g. increase or inhibit) the activity or expression of GENSET. More specifically, the 
present invention relates to methods of testing compounds for their ability either to increase or to 
decrease expression or activity of GENSET. The assays are performed in vitro or in vivo. 
In vitro methods 

25 In vitro f cells expressing GENSET polypeptides are incubated in the presence and absence 

of the test compound. By determining the level of GENSET expression in the presence of the test 
compound or the level of GENSET activity in the presence of the test compound, compounds can 
be identified that suppress or enhance GENSET expression or activity. Alternatively, constructs 
comprising a GENSET regulatory sequence operably linked to a reporter gene (e.g. luciferase, 

30 chloramphenicol acetyl transferase, LacZ, green fluorescent protein, etc.) can be introduced into 
host cells and the effect of the test compounds on expression of the reporter gene detected. Cells 
suitable for use in the foregoing assays include, but are not limited to, cells having the same origin 
as tissues or cell lines in which the polypeptide has been determined to be expressed by methods 
common to the art such as discussed herein. Consequently, the present invention encompasses a 

35 method for screening molecules that modulate the expression of a GENSET gene, said screening 
method comprising the steps of: 

a) cultivating a prokaryotic or an eukaryotic cell that has been transfected 
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with a nucleotide sequence encoding a GENSET protein or a variant or a 
fragment thereof, placed under the control of its own promoter, 

b) bringing into contact said cultivated cell with a molecule to be tested; 

c) quantifying the expression of said GENSET protein or a variant or a 
5 fragment thereof in the presence of said molecule. 

Using DNA recombination techniques well known by the one skill in the art, the GENSET 
protein encoding DNA sequence is inserted into an expression vector, downstream from its 
promoter sequence. As an illustrative example, the promoter sequence of the GENSET gene is 
contained in the 5* untranscribed region of the GENSET genomic DNA. 

1 0 The quantification of the expression of a GENSET protein may be realized either at the 

mRNA level (using for example Northen blots, RT-PCR, preferably quantitative RT-PCR with 

. primers and probes specific for the GENSET mRNA of interest) or at the protein level (using 
polyclonal or monoclonal antibodies in immunoassays such as ELISA or RIA assays, Western 
blots, or immunochemistry). 

15 The present invention also concerns a method for screening substances or molecules that 

are able to increase, or in contrast to decrease, the level of expression of a GENSET gene. Such a 
method may allow the one skilled in the art to select substances exerting a regulating effect on the 
expression level of a GENSET gene and which may be useful as active ingredients included in 
pharmaceutical compositions for treating patients suffering from disorders associated with 

20 abnormal levels of GENSET products. 

Thus, another part of the present invention is a method for screening a candidate molecule that 
modulates the expression of a GENSET gene, this method comprises the following steps: 

a) providing a recombinant cell host containing a nucleic acid, wherein said 
nucleic acid comprises a GENSET 5' regulatory region or a regulatory 

25 active fragment or variant thereof, operably linked to a polynucleotide 

encoding a detectable protein; 

b) obtaining a candidate molecule; and 

c) determining the ability of said candidate molecule to modulate the 
expression levels of said polynucleotide encoding the detectable protein. 

30 In a further embodiment, said nucleic acid comprising a GENSET 5' regulatory region or a 

regulatory active fragment or variant thereof, includes the 5'UTR region of a GENSET cDNA 
selected from the group comprising of the 5'UTRs of the polynucleotide sequences of the 
Sequence Listing, those of human cDNA clone inserts of the deposited clone pool, regulatory 
active fragments, and variants thereof. In a more preferred embodiment of the above screening 

35 method, said nucleic acid includes a promoter sequence which is endogenous with respect to the 
GENSET 5'UTR sequence. In another more preferred embodiment of the above screening 
method, said nucleic acid includes a promoter sequence which is exogenous with respect to the 
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GENSET 5'UTR sequence defined therein. 

Preferred polynucleotides encoding a detectable protein are polynucleotides encoding beta 
galactosidase, green fluorescent protein (GFP) and chloramphenicol acetyl transferase (CAT). 

The invention further relates to a method for the production of a pharmaceutical 
5 composition comprising a method of screening a candidate molecule that modulates the expression 
of a GENSET gene and furthermore mixing the identified molecule with a physiologically 
acceptable carrier. 

The invention also pertains to kits for the screening of a candidate substance modulating 
the expression of a GENSET gene. Preferably, such kits comprise a recombinant vector that 
10 allows the expression of a GENSET 5' regulatory region or a regulatory active fragment or a 
variant thereof, operably linked to a polynucleotide encoding a detectable protein or a GENSET 
protein or a fragment or a variant thereof. More preferably, such kits include a recombinant vector 
that comprises a nucleic acid including the 5'UTR region of a GENSET cDNA selected from the 
group comprising the 5'UTRs of the polynucleotide sequences of the Sequence Listing, those of 
15 human cDNA clone inserts of the deposited clone pool, regulatory active fragments and variants 
thereof, being operably linked to a polynucleotide encoding a detectable protein. 

For the design of suitable recombinant vectors useful for performing the screening 
methods described above, it will be referred to the section of the present specification wherein the 
preferred recombinant vectors of the invention are detailed. 
20 Another object of the present invention comprises methods and kits for the screening of 

candidate substances that interact with a GENSET polypeptide, fragments or variants thereof. By 
' their capacity to bind covalently or non-covalently to a GENSET protein, fragments or variants 
thereof, these substances or molecules may be advantageously used both in vitro and in vivo. 

In vitro, said interacting molecules may be used as detection means in order to identify the 
25 presence of a GENSET protein in a sample, preferably a biological sample. 

A method for the screening of a candidate substance that interact with a GENSET 
polypeptide, fragments or variants thereof, said methods comprising the following steps: 

a) providing a polypeptide comprising, consisting essentially of, or 
consisting of a GENSET protein or a fragment comprising a contiguous 

30 span of at least 6 amino acids, preferably at least 8 to 10 amino acids, 

more preferably at least 12, 15, 20, 25, 30, 40, 50, or 100 amino acids of a 
polypeptide of the present invention; 

b) obtaining a candidate substance; 

c) bringing into contact said polypeptide with said candidate substance; 
35 d) detecting the complexes formed between said polypeptide and said 

candidate substance. 
The invention further relates to a method for the production of a pharmaceutical 
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composition comprising a method for the screening of a candidate substance that interact with a 
GENSET polypeptide, fragments or variants thereof and furthermore mixing the identified 
substance with a physiologically acceptable carrier. 

The invention further concerns a kit for the screening of a candidate substance interacting 
5 with the GENSET polypeptide, wherein said kit comprises: 

a) polypeptide comprising, consisting essentially of, or consisting of a 
GENSET protein or a fragment comprising a contiguous span of at least 6 
amino acids, preferably at least 8 to 10 amino acids, more preferably at 
least 12, 15, 20, 25, 30, 40, 50, or 100 amino acids of a polypeptide of the 

10 present invention; and 

b) optionally means useful to detect the complex formed between said 
polypeptide or a variant thereof and the candidate substance. 

In a preferred embodiment of the kit described above, the detection means comprises a 
monoclonal or polyclonal antibody binding to said GENSET protein or fragment or variant thereof. 

15 

In vivo methods 

Compounds that suppress or enhance GENSET expression can also be identified using in 
vivo screens. In these assays, the test compound is administered (e.g. intravenously, 
intraperitoneally, intramuscularly, orally, or otherwise), to the animal, for example, at a variety of 

20 dose levels. The effect of the compound on GENSET expression is determined by comparing 
GENSET levels, for example in tissues known to express the gene of interest using Northern blots, 
immunoassays, PCR, etc., as described above. Suitable test animals include, but are not limited to, 
rodents (e.g., mice and rats), primates, and rabbits. Humanized mice can also be used as test 
animals, that is mice in which the endogenous mouse protein is ablated (knocked out) and the 

25 homologous human protein added back by standard transgenic approaches. Such mice express 
only the human form of a protein. Humanized mice expressing only the human GENSET can be 
used to study in vivo responses to potential agents regulating GENSET protein or mRNA levels. 
As an example, transgenic mice have been produced carrying the human apoE4 gene. They are 
then bred with a mouse line that lacks endogenous apoE, to produce an animal model carrying 

30 human proteins believed to be instrumental in development of Alzheimer's pathology. Such 

transgenic animals are useful for dissecting the biochemical and physiological steps of disease, and 
for development of therapies for disease intervention (Loring, et al, 1996) (incorporated herein by 
reference in its entirety). 

35 Uses for compounds modulating GENSET expression and/or biological activity 

Using in vivo (or in vitro) systems, it may be possible to identify compounds that exert a 
tissue specific effect, for example, that increase GENSET expression or activity only in tissues of 
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interest, such as the adrenal gland, bone marrow, brain, cerebellum, colon, fetal brain, fetal kidney, 
fetal liver, heart, hypertrophic prostate, kidney, liver, lung, lymph ganglia, lymphocytes, muscle, 
ovary, pancreas, pituitary gland, placenta, prostate, salivary gland, spinal cord, spleen, stomach, 
intestine, substantia nigra., testis, thyroid, umbilical cord, and uterus. Screening procedures such as 

5 those described above are also useful for identifying agents for their potential use in 

pharmacological intervention strategies. Agents that enhance GENSET gene expression or 
stimulate its activity may thus be used to induce any phenotype associated with a GENSET gene, 
or to treat disorders resulting from a deficiency of a GENSET polypeptide activity or expression. 
Compounds that suppress GENSET polypeptide expression or inhibit its activity can be used to 

1 0 treat any disease or condition associated with increased or deleterious GENSET polypeptide 
activity or expression. 

Also encompassed by the present invention is an agent which interacts with a GENSET 
gene or polypeptide directly or indirectly, and inhibits or enhances GENSET polypeptide 
expression and/or function. In one embodiment, the agent is an inhibitor which interferes with a 

15 GENSET polypeptide directly (e.g., by binding the GENSET polypeptide) or indirectly (e.g., by 
blocking the ability of the GENSET polypeptide to have a GENSET biological activity). In a 
particular embodiment, an inhibitor of a GENSET protein is an antibody specific for the GENSET 
protein or a functional portion of the GENSET protein; that is, the antibody binds a GENSET 
polypeptide. For example, the antibody can be specific for a polypeptide encoded by one of the 

20 nucleic acid sequences of human GENSET nucleic acids, a mammalian GENSET nucleic acid, or 
portions thereof. Alternatively, the inhibitor can be an agent other than an antibody (e.g., small 
organic molecule, protein or peptide) which binds the GENSET polypeptide and blocks its activity. 
For example, the inhibitor can be an agent which mimics the GENSET polypeptide structurally, 
but lacks its function. Alternatively, it can be an agent which binds to or interacts with a molecule 

25 . which the GENSET polypeptide normally binds to or interacts with, thus blocking the GENSET 

• polypepetide from doing so and preventing it from exerting the effects it would normally exert. 

In another embodiment, the agent is an enhancer (activator) of a GENSET polypeptide 
which increases the activity of the GENSET polypeptide (increases the effect of a given amount or 
level of GENSET), increases the length of time it is effective (by preventing its degradation or 

30 otherwise prolonging the time during which it is active) or both either directly or indirectly. For 
example, GENSET polynucleotides and polypeptides can be used to identify drugs which increase 
or decrease the ability of GENSET polypeptides to induce GENSET biological activity, which 
drugs are useful for the treatment or prevention of any disease or condition associated with a 
GENSET biological activity. 

35 The GENSET sequences of the present invention can also be used to generate nonhuman 

gene knockout animals, such as mice, which lack a GENSET gene or transgenically overexpress a 
GENSET gene. For example, such GENSET gene knockout mice can be generated and used to 
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obtain further insight into the function of the GENSET gene as well as assess the specificity of 
GENSET activators and inhibitors. Also, over expression of the GENSET gene (e.g., a human 
GENSET gene) in transgenic mice can be used as a means of creating a test system for GENSET 
activators and inhibitors (e.g., against a human GENSET polypeptide). In addition, the GENSET 
5 gene can be used to clone the GENSET promoter/enhancer in order to identify regulators of 
GENSET gene transcription. GENSET gene knockout animals include animals which completely 
or partially lack the GENSET gene and/or GENSET activity or function. Thus the present 
invention relates to a method of inhibiting (partially or completely) a GENSET biological activity 
in a mammal (e.g., a human), the method comprising administering to the mammal an effective 
10 amount of an inhibitor of a GENSET polypeptide or polynucleotide. The invention also relates to 
a method of enhancing a GENSET biological activity in a mammal, the method comprising 
administering to the mammal an effective amount of an enhancer of a GENSET polypeptide or 
polynucleotide. 

1 5 Inhibiting GENSET gene expression 

Therapeutic compositions according to the present invention may comprise advantageously 
one or several GENSET oligonucleotide fragments as an antisense tool or a triple helix tool that 
inhibits the expression of the corresponding GENSET gene. 
Antisense Approach 

20 In antisense approaches, nucleic acid sequences complementary to an mRNA are 

hybridized to the mRNA intracellularly, thereby blocking the expression of the protein encoded by 
the mRNA. The antisense nucleic acid molecules to be used in gene therapy may be either DNA 
or RNA sequences. Preferred methods using antisense polynucleotide according to the present 
. invention are the procedures described by Sczakiel et aL, (1995) Trends Microbiol. 3(6):213-217, 

25 which disclosure is hereby incorporated by reference in its entirety. . 

Preferably, the antisense tools are chosen among the polynucleotides (15-200 bp long) that 
are complementary to GENSET mRNA, more preferably to the 5 'end of the GENSET mRNA. In 
another embodiment, a combination of different antisense polynucleotides complementary to 
different parts of the desired targeted gene are used. 

30 Other preferred antisense polynucleotides according to the present invention are sequences 

complementary to either a sequence of GENSET mRNAs comprising the translation initiation 
codon ATG or a sequence of GENSET genomic DNA containing a splicing donor or acceptor site. 
Preferably, the antisense polynucleotides of the invention have a V polyadenylation signal that has 
been replaced with a self-cleaving ribozyme sequence, such that RNA polymerase n transcripts are 

35 produced without poly(A) at their 3 * ends, these antisense polynucleotides being incapable of 
export from the nucleus, such as described by Liu et al (1994), Proc. Natl. Acad. Sci. USA. 91: 
4528-4262, which disclosure is hereby incorporated by reference in its entirety. In a preferred 
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embodiment, these GENSET antisense polynucleotides also comprise, within the ribozyme 
cassette, a histone stem-loop structure to stabilize cleaved transcripts against 3'-5' exonucleolytic 
degradation, such as the structure described by Eckner et al. 9 (1991) EMBO J. 10:3513-3522, 
which disclosure is hereby incorporated by reference in its entirety. 
5 The antisense nucleic acids should have a length and melting temperature sufficient to 

permit formation of an intracellular duplex having sufficient stability to inhibit the expression of 
the GENSET mRNA in the duplex. Strategies for designing antisense nucleic acids suitable for 
use in gene therapy are disclosed in Green et al, (1986) Ann. Rev. Biochem. 55:569-597 and Izant 
and Weintraub, (1984) Cell 36(4): 1007-1 5, the disclosures of which are incorporated herein by 
10 reference. 

In some strategies, antisense molecules are obtained by reversing the orientation of the 
GENSET coding region with respect to a promoter so as to transcribe the opposite strand from that 
which is normally transcribed in the cell. The antisense molecules may be transcribed using in 
vitro transcription systems such as those which employ T7 or SP6 polymerase to generate the 

1 5 transcript. Another approach involves transcription of GENSET antisense nucleic acids in vivo by 
operably linking DNA containing the antisense sequence to a promoter in a suitable expression 
vector. Alternatively, oligonucleotides which are complementary to the strand normally ' 
transcribed in the cell may be synthesized in vitro. Thus, the antisense nucleic acids are 
complementary to the corresponding mRNA and are capable of hybridizing to the mRNA to create 

20 a duplex. 

Specific examples of preferred antisense compounds useful in this invention include 
• oligonucleotides containing modified backbones or non-natural internucleoside linkages. As 
defined in this specification, oligonucleotides having modified backbones include those that retain 
a phosphorus atom in the backbone and those that do not have a phosphorus atom in the backbone. 

25 For the purposes of this specification, and as sometimes referenced in the art, modified 

oligonucleotides that do not have a phosphorus atom in their internucleoside backbone can also be 
considered to be oligonucleosides. 

Preferred modified oligonucleotide backbones include, for example, phosphorothioates, 
chiral phosphorothioates, phosphorodithioates, phosphotriesters, aminoalkylphosphotriesters, 

30 methyl and other alkyl phosphonates including 3-alkylene phosphonates and chiral phosphonates, 
phosphinates, phosphoramidates including 3'-amino phosphoramidate and 
ammoalkylphosphoramidates, thionophosphoramidates, thionoalkylphosphonates, 
thionoalkylphosphotriesters, and boranophosphates having normal 3-5" linkages, 2'-5 f linked 
analogs of these, and those having inverted polarity wherein the adjacent pairs of nucleoside units 

35 are linked 3-5' to 5-3 'or 2-5' to 5 ! -2 ! . Various salts, mixed salts and free acid forms are also 
included. 

Preferred modified oligonucleotide backbones that do not include a phosphorus atom 
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therein have backbones that are formed by short chain alky] or cycloalkyl intemucleoside linkages, 
mixed heteroatom and alkyl or cycloalkyl intemucleoside linkages, or one or more short chain 
heteroatomic or heterocyclic intemucleoside linkages. These include those having morpholino 
linkages (formed in part from the sugar portion of a nucleoside); siloxane backbones; sulfide, 

5 sulfoxide and sulfone backbones; formacetyl and thioformacetyl backbones; methylene formacetyl 
and thioformacetyl backbones; alkene containing backbones; sulfamate backbones; 
methyleneimino and methylenehydrazino backbones; sulfonate and sulfonamide backbones; amide 
backbones; and others having mixed N, O, S and CH.sub.2 component parts. 

In other preferred oligonucleotide mimetics, both the sugar and the intemucleoside linkage, 

10 i.e., the backbone, of the nucleotide units are replaced with novel groups. The base units are 
maintained for hybridization with an appropriate nucleic acid target compound. Tne such 
oligomeric compound, an oligonucleotide mimetic that has been shown to have excellent 
hybridization properties, is referred to as a peptide nucleic acid (PNA). In PNA compounds, the 
sugar-backbone of an oligonucleotide is replaced with an amide containing backbone, in particular 

15 an aminoethylglycine backbone. The nucleobases are retained and are bound directly or indirectly 
to aza nitrogen atoms of the amide portion of the backbone. 

Oligonucleotides may also include nucleobase (often referred to in the art simply as 
"base") modifications or substitutions. As used herein, "unmodified" or "natural" nucleobases 
include the purine bases adenine (A) and guanine (G), and the pyrimidine bases thymine (T), 

20 cytosine (C) and uracil (U). Modified nucleobases include other synthetic and natural nucleobases 
such as 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine, xanthine, hypoxanthine, 2- 
aminoadenine, 6-methyl and other alkyl derivatives of adenine and guanine, 2-propyl and other 
alkyl derivatives of adenine and guanine, 2-thiouracil, 2-thiothymine and 2-thiocytosine, 5- 
halouracil and cytosine, 5-propynyl uracil and cytosine, 6-azo uracil, cytosine and thymine, 5- 

25 uracil (pseudouracil), 4-thiouracil, 8-halo, 8-amino, 8-thiol, 8-thioalkyl, 8-hydroxyl and other 8- 
substituted adenines and guanines, 5 -halo particularly 5-bromo, 5-trifluoromethyl and other 5- 
substituted uracils and cytosines, 7-methylguanine and 7-methyladenine, 8-azaguanine and 8- 
azaadenine, 7-deazaguanine and 7-deazaadenine and 3-deazaguanine and 3-deazaadenine. Further 
nucleobases include those disclosed in U.S. Pat. No. 3,687,808, those disclosed in The Concise 

30 Encyclopedia Of Polymer Science And Engineering, pages 858-859, Kroschwitz, J. L, ed. John 
Wiley & Sons, 1990, those disclosed by Englisch et al., Angewandte Chemie, International 
Edition, 1991, 30, 613, and those disclosed by Sanghvi, Y. S., Chapter 15, Antisense Research and 
Applications, pages 289-302, Crooke, S. T. and Lebleu, B. ed., CRC Press, 1993. Certain of these 
nucleobases are particularly useful for increasing the binding affinity of the oligomeric compounds 

35 of the invention. These include 5-substituted pyrimidines, 6-azapyrimidines and N-2, N-6 and 0-6 
substituted purines, including 2-aminopropyladenine, 5-propynyluracil and 5-propynylcytosine. 5- 
methylcytosine substitutions have been shown to increase nucleic acid duplex stability by 0.6- 
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1.2.degree. C. (Sanghvi, Y. S., Crooke, S. T. and Lebleu, B., eds., Antisense Research and 
Applications, CRC Press, Boca Raton, 1993, pp. 276-278) and are presently prefetfed base 
substitutions, even more particularly when combined with 2 , -0-methoxyethyl sugar modifications 
(U.S. Patent 6,242,590, hereby incorporated by reference). 
5 Various types of antisense oligonucleotides complementary to the sequence of the 

GENSET cDNA or genomic DNA may be used. In one preferred embodiment, stable and semi- 
stable antisense oligonucleotides described in International Application No. PCT WO94/23026, 
hereby incorporated by reference, are used. In these molecules, the 3' end or both the 3' and 5' 
ends are engaged in intramolecular hydrogen bonding between complementary base pairs. These 

10 molecules are better able to withstand exonuclease attacks and exhibit increased stability compared 
to conventional antisense oligonucleotides. 

In another preferred embodiment, the antisense oligodeoxynucleotides against herpes 
simplex virus types 1 and 2 described in International Application No. WO 95/04141, hereby 
incorporated by reference, are used. 

15 ■ In yet another preferred embodiment, the covalently cross-linked antisense 

oligonucleotides described in International Application No. WO 96/3 1 523, hereby incorporated by 
reference, are used. These double- or single-stranded oligonucleotides comprise one or more, 
respectively, inter- or intra-oligonucleotide covalent cross-linkages, wherein the linkage consists of 
■ an amide bond between a primary amine group of one strand and a carboxyl group of the other 

20 strand or of the same strand, respectively, the primary amine group being directly substituted in the 
2' position of the strand nucleotide monosaccharide ring, and the carboxyl group being carried by 
an aliphatic spacer group substituted on a nucleotide or nucleotide analog of the other strand or the 
same strand, respectively. 

The antisense oligodeoxynucleotides and oligonucleotides disclosed in International 

25 Application No. WO 92/1 8522, incorporated by reference, may also be used. These molecules are 
stable to degradation and contain at least one transcription control recognition sequence which 
binds to control proteins and are effective as decoys therefor. These molecules may contain 
"haiipin" structures, "dumbbell" structures, "modified dumbbell" structures, "cross-linked" decoy 
structures and "loop" structures. 

30 In another preferred embodiment, the cyclic double-stranded oligonucleotides described in 

European Patent Application No. 0 572 287 A2, hereby incorporated by reference are used. These 
ligated oligonucleotide "dumbbells" contain the binding site for a transcription factor and inhibit 
expression of the gene under control of the transcription factor by sequestering the factor. 

Use of the closed antisense oligonucleotides disclosed in International Application No. 

35 WO 92/19732, hereby incorporated by reference, is also contemplated. Because these molecules 
have no free ends, they are more resistant to degradation by exonucleases than are conventional 
oligonucleotides. These oligonucleotides may be multifunctional, interacting with several regions 



355 



WO 02/094864 PCT/IB01/01715 

which are not adjacent to the target mRNA. 

Another modification of the oligonucleotides of the invention involves chemically linking 
to the oligonucleotide one or more moieties or conjugates which enhance the activity, cellular 
distribution or cellular uptake of the oligonucleotide. Such moieties include but are not limited to 
5 lipid moieties such as a cholesterol moiety (Letsinger et al., Proc. Natl. Acad. Sci. USA (1989) 86: 
6553-6556), cholic acid (Manoharan et al., Bioorg. Med. Chem. Let. (1994) 4:1053-1060), a 
thioether, e.g., hexyl-S-tritylthiol (Manoharan et al., Ann. N.Y. Acad. Sci. (1992) 660:306-309; 
Manoharan et al., Bioorg. Med. Chem. Let. (1993) 3:2765-2770), a thiocholesterol (Oberhauser et 
al., Nucl. Acids Res. (1992) 20:533-538), an aliphatic chain, e.g., dodecandiol or undecyl residues 

10 (Saison-Behmoaras et al., EMBO J. (1991) 10:1 111-1118; Kabanov et al., FEBS Lett. (1990) 259: 
327-330; Svinarchuk et al., Biochimie (1993) 75:49-54), a phospholipid, e.g., di-hexadecyl-rac- 
glycerol or triethylammonium l,2-di-0-hexadecyl-rac-glycero-3-H-phosphonate (Manoharan et 
al., Tetrahedron Lett., 1995, 36, 3651-3654; Shea et al., Nucl. Acids Res. (1990) 18:3777-3783), a 
polyamine or a polyethylene glycol chain (Manoharan et al., Nucleosides & Nucleotides (1995) 14: 

15 969-973), or adamantane acetic acid (Manoharan et al., Tetrahedron Lett. (1995) 36:365 1 -3654), a 
palmityl moiety (Mishra et al., Biochim. Biophys. Acta (1995) 1264:229-237), or an 
octadecylamine or hexylamino-carbonyl-oxycholesterol moiety (Crooke et al., J. Pharmacol. Exp. 
Ther. (1996) 277:923-937; U.S. Patent 6,242,590, which disclosures are hereby incorporated by 
reference in their entireties 

20 It is not necessary for all positions in a given compound to be uniformly modified, and in 

fact more than one of the aforementioned modifications may be incorporated in a single compound 
or even at a single nucleoside within an oligonucleotide. The present invention also includes 
antisense compounds which are chimeric compounds. "Chimeric" antisense compounds or 
"chimeras," in the context of this invention, are antisense compounds, particularly 

25 oligonucleotides, which contain two or more chemically distinct regions, each made up of at least 
one monomer unit, i.e., a nucleotide in the case of an oligonucleotide compound. These 
oligonucleotides typically contain at least one region wherein the oligonucleotide is modified so as 
to confer upon the oligonucleotide increased resistance to nuclease degradation, increased cellular 
uptake, and/or increased binding affinity for the target nucleic acid. An additional region of the 

30 oligonucleotide may serve as a substrate for enzymes capable of cleaving RNA:DNA or 

RNA:RNA hybrids. By way of example, RNase H is a cellular endonuclease which cleaves the 
RNA strand of an RNA:DNA duplex. Activation of RNase H, therefore, results in cleavage of the 
RNA target, thereby greatly enhancing the efficiency of oligonucleotide inhibition of gene 
expression. Consequently, comparable results can often be obtained with shorter oligonucleotides 

35 when chimeric oligonucleotides are used, compared to phosphorothioate deoxyoligonucleotides 
hybridizing to the same target region. Cleavage of the RNA target can be routinely detected by gel 
electrophoresis and, if necessary, associated nucleic acid hybridization techniques known in the art 
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(U.S. Patent 6,242,590, hereby incorporated by reference). 

Further included in the present invention is a method of high throughput screening of 
antisense nucleic acids and modified versions thereof for binding to targeted GENSET 
polynucleotide sequences or fragments thereof. This method is directed toward determining 
5 optimally targeted sequences and/ or optimal species of targeting antisense molecules for binding. 
A preferred method comprises the steps of: contacting a random pool of test molecules with a set 
array of GENSET polynucleotide sequences or fragments thereof; detecting and quantifying 
binding of test molecules to said array; and purification and identification of binding test molecules 
as discussed in U.S. Patent 6,022,691, which disclosure is hereby incorporated by reference. 

10 Preferred test molecules are antisense oligonucleotides, oligonucleosides, and modified versions 
thereof as discussed herein. Further preferred test molecules are those that are capable of forming 
hydrogen bonds with GENSET polynucleotide sequences or fragments thereof. 

The appropriate level of antisense nucleic acids required to inhibit gene expression may be 
determined using in vitro expression analysis. The antisense molecule may be introduced into the 

15 cells by diffusion, injection, infection or transfection using procedures known in the art. For 
example, the antisense nucleic acids can be introduced into the body as a bare or naked 
oligonucleotide, oligonucleotide encapsulated in lipid, oligonucleotide sequence encapsidated by 
viral protein, or as an oligonucleotide operably linked to a promoter contained in an expression 
vector. The expression vector may be any of a variety of expression vectors known in the art, 

20 including retroviral or viral vectors, vectors capable of extrachromosomal replication, or 
integrating vectors. The vectors may be DNA or RNA. 

The antisense compounds of the invention encompass any physiologically acceptable salts, 
esters, or salts of such esters, or any other compound which, upon administration to an animal 
including a human, is capable of providing (directly or indirectly) the biologically active 

25 metabolite or residue thereof. Accordingly, for example, the disclosure is also drawn to prodrugs 
and physiologically acceptable salts of the compounds of the invention, physiologically acceptable 
salts of such prodrugs, and other bioequivalents as discussed herein. 

The antisense molecules are introduced onto cell samples at a number of different 
concentrations preferably between lxl0" 1D M to IxlO^M. Once the minimum concentration that 

30 can adequately control gene expression is identified, the optimized dose is translated into a dosage 
suitable for use in vivo. For example, an inhibiting concentration in culture of lxl 0" 7 translates 
into a dose of approximately 0.6 mg/kg bodyweight. Levels of oligonucleotide approaching 100 
mg/kg bodyweight or higher may be possible after testing the toxicity of the oligonucleotide in 
laboratory animals. It is additionally contemplated that cells from the vertebrate are removed, 

35 treated with the antisense oligonucleotide, and reintroduced into the vertebrate. 

hi a preferred application of this invention, the polypeptide encoded by the gene is first 
identified, so that the effectiveness of antisense inhibition on translation can be monitored using 
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techniques that include but are not limited to antibody-mediated tests such as RIAs and ELISA, 
functional assays, or radiolabeling. 

An alternative to the antisense technology that is used according to the present invention 
comprises using ribozymes that will bind to a target sequence via their complementary 
5 polynucleotide tail and that will cleave the corresponding RNA by hydrolyzing its target site 
(namely "hammerhead ribozymes"). Briefly, the simplified cycle of a hammerhead ribozyme 
comprises (1) sequence specific binding to the target RNA via complementary antisense 
sequences; (2) site-specific hydrolysis of the cleavable motif of the target strand; and (3) release of 
cleavage products, which gives rise to another catalytic cycle. Indeed, the use of long-chain 

10 antisense polynucleotide (at least 30 bases long) or ribozymes with long antisense arms are 
advantageous. A preferred delivery system for antisense ribozyme is achieved by covalently 
linking these antisense ribozymes to lipophilic groups or to use liposomes as a convenient vector. 
Preferred antisense ribozymes according to the present invention are prepared as described by 
Rossi et aL, (1991) Pharmacol. Ther. 50:245-254 and Sczakiel et al (1995), the specific 

1 5 preparation procedures being referred to in said articles being herein incorporated by reference. 
Triple Helix Approach 

The GENSET genomic DNA may also be used to inhibit the expression of the GENSET 
gene based on intracellular triple helix formation. 

Triple helix oligonucleotides are used to inhibit transcription from a genome. They are 

20 particularly useful for studying alterations in cell activity when it is associated with a particular 
gene. The GENSET cDNAs or genomic DNAs of the present invention or, more preferably, a 
fragment of those sequences, can be used to inhibit gene expression in individuals having diseases 
associated with expression of a particular gene. Similarly, a portion of the GENSET genomic 
DNA can be used to study the effect of inhibiting GENSET gene transcription within a cell. 

25 Traditionally, homopurine sequences were considered the most useful for triple helix strategies. 
However, homopyrimidine sequences can also inhibit gene expression. Such homopyrirnidine 
oligonucleotides bind to the major groove at homopurineihomopyrimidine sequences. Thus, both 
types of sequences from the GENSET genomic DNA are contemplated within fee scope of this 
. invention. 

30 To carry out gene therapy strategies using the triple helix approach, the sequences of the 

GENSET genomic DNA are first scanned to identify 10-mer to 20-mer homopyrimidine or 
homopurine stretches which could be used in triple-helix based strategies for inhibiting GENSET 
expression. Following identification of candidate homopyrimidine or homopurine stretches, their 
efficiency in inhibiting GENSET expression is assessed by introducing varying amounts of 

35 oligonucleotides containing the candidate sequences into tissue culture cells which express the 
GENSET gene. 

The oligonucleotides can be introduced into fee cells using a variety of methods known to those 
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skilled in the art, including but not limited to calcium phosphate precipitation, DEAE-Dextran, 
electroporation, liposome-mediated transfection or native uptake. 

Treated cells are monitored for altered cell function or reduced GENSET expression using 
techniques such as Northern blotting, RNase protection assays, or PGR based strategies to monitor 
5 the transcription levels of the GENSET gene in cells which have been treated with the 

oligonucleotide. The cell functions to be monitored are predicted based upon the homologies of 
the target gene corresponding to the cDNA from which the oligonucleotide was derived with 
known gene sequences that have been associated with a particular function. The cell functions can 
also be predicted based on the presence of abnormal physiology within cells derived from 

10 individuals with a particular inherited disease, particularly when the cDNA is associated with the 
disease using techniques described in the section entitled "Identification of genes associated with 
hereditary diseases or drug response". 

The oligonucleotides which are effective in inhibiting gene expression in tissue culture 
cells may then be introduced in vivo using the techniques and at a dosage calculated based on the 

15 in vitro results, as described in the section entided "Antisense Approach". 

In some embodiments, the natural (beta) anomers of the oligonucleotide units can be 
replaced with alpha anomers to render the oligonucleotide more resistant to nucleases. Further, an 
intercalating agent such as ethidium bromide, or the like, can be attached to the 3' end of the alpha 
oligonucleotide to stabilize the triple helix. For information on the generation of oligonucleotides 

20 suitable for triple helix formation. See Griffin et a/., (1989) Science 245:967-971, which is hereby 
incorporated by this reference. 

Treating GENSET gene-related disorders 

The present invention further relates to methods, uses of GENSET polypeptides and 
■ 25 polynucleotides, and uses of modulators of GENSET polypeptides and polynucleotides, for 

treating diseases/disorders associated with GENSET genes by increasing or decreasing GENSET 
gene activity and/or expression. These methodologies can be effected using compounds selected 
using screening protocols such as those described herein and/or by using the gene therapy and 
antisense approaches described in the art and herein. Gene therapy can be used to effect targeted 

30 expression of GENSET genes in any tissue, e.g. a tissue associated with the disease or condition to 
be treated. The GENSET coding sequence can be cloned into an appropriate expression vector and 
targeted to a particular cell type(s) to achieve efficient, high level expression. Introduction of the 
GENSET coding sequence into target cells can be achieved, for example, using particle mediated 
DNA delivery, [Haynes et ai, (1996) J Biotechnol. 44(l-3):37-42 and Maurer et a/., (1999) Mol 

35 Membr Biol. 16(1): 129-40], direct injection of naked DNA, [Levy et al y (1996) Gene Then 

3(3):201-1 1; and Feigner (1996) Hum Gene Ther. 7(15): 1791-3], or viral vector mediated transport 
[Smith et aL, (1996) Antiviral Res. 32(2):99-115, Stone et a/., (2000) J Endocrinol. 164(2):103-18; 
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Wu and Ataai (2000), Curr Opin Biotechnol. 1 1(2):205«8], each of which disclosures are hereby 
incorporated by reference in their entireties. Tissue specific effects can be achieved, for example, 
in the case of virus mediated transport by using viral vectors that are tissue specific, or by the use 
of promoters that are tissue specific. For instance, any tissue-specific promoter may be used to 
5 achieve specific expression, for example albumin promoters (liver specific; Pinkert et al., 1987 
Genes Dev. 1:268-277), lymphoid specific promoters (Calame et aL, 1988 Adv. Immunol. 43:235- 
275), promoters of T-cell receptors (Winoto et al., 1989 EMBO J. 8:729-733) and 
immunoglobulins (Banerji et al., 1983 Cell 33:729-740; Queen and Baltimore 1983 Cell 33:741- 
748), neuron-specific promoters (e.g. the neurofilament promoter; Byrne et al., 1989 Proc. Natl. 

10 Acad. Sci. USA 86:5473-5477), pancreas-specific promoters (Edlunch et al., 1985 Science 
230:912-916) or mammary gland-specific promoters (milk whey promoter, U.S. Pat. No. 
4,873,316 and European Application Publication No. 264, 166). Developmentally-regulated 
promoters can also be used, such as the murine homeobox promoters (Kessel et al., 1990 Science 
249:374-379) or the alpha-fetoprotein promoter (Campes et al., 1989 Genes Dev. 3:537-546). 

15 Combinatorial approaches can also be used to ensure that the GENSET coding sequence is 

activated in the target tissue [Butt and Karathanasis (1995) Gene Expr. 4(6):319-36; Miller and 
Whelan, (1997) Hum Gene Ther. 8(7):803-15], which disclosures are hereby incorporated by 
reference in their entireties. Antisense oligonucleotides complementary to GENSET mRNA can 
be used to selectively diminish or ablate the expression of the protein, for example, at sites of 

20 inflammation. More specifically, antisense constructs or antisense oligonucleotides can be used to 
inhibit the production of GENSET in high expressing cells such as determined by methods 
common to the art or discussed herein. Antisense mRNA can be produced by transfecting into 
target cells an expression vector with the GENSET gene sequence, or a portion thereof, oriented in 
an antisense direction relative to the direction of transcription. Appropriate vectors include viral 

25 vectors, including retroviral, adenoviral, and adeno-associated viral vectors, as well as nonviral 
vectors. Tissue specific promoters can be used, as described supra. Alternatively, antisense 
oligonucleotides can be introduced directly into target cells to achieve the same goal. (See also 
other delivery methodologies described herein in connection with gene therapy.). Oligonucleotides 
can be selected/designed to achieve a high level of specificity [Wagner, et al. (1996), Nat 

30 Biotechnol. 14(7):840-4], which disclosure is hereby incorporated by reference in its entirety. The 
therapeutic methodologies described herein are applicable to both human and non-human 
mammals (including cats and dogs). 

Pharmaceutical and physiologically acceptable compositions 
35 The present invention also relates to pharmaceutical or physiologically acceptable 

compositions comprising, as active agent, the polypeptides, nucleic acids or antibodies of the 
invention. The invention also relates to compositions comprising, as active agent, compounds 
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selected using the above-described screening protocols. Such compositions include the active 
agent in combination with a pharmaceutical or physiologically acceptable carriers such as a 
physiologically acceptable salt, ester, or salt of such esters. In the case of naked DNA, the 
"carrier" may be gold particles. The amount of active agent in the composition can vary with the 
5 agent, the patient and the effect sought. Likewise, the dosing regimen can vary depending on the 
composition and the disease/disorder to be treated. 

Therefore, the invention related to methods for the production of pharmaceutical 
composition comprising a method for selecting an active agent, compound, substance or molecule 
using any of the screening method described herein and furthermore mixing the identified active 

10 agent, compound, substance or molecule with a physiologically acceptable carrier. 

The term "physiologically acceptable salts" refers to physiologically and pharmaceutically 
acceptable salts of the compounds of the invention: i.e., salts that retain the desired biological 
activity of the parent compound and do not impart undesired toxicological effects thereto. 
Physiologically acceptable base addition salts are formed with metals or amines, such as alkali and 

15 alkaline earth metals or organic amines. Examples of metals used as cations are sodium, potassium, . 
magnesium, calcium, and the like. Examples of suitable amines are N.N'-dibenzylethylenediamine, 
chloroprocaine, choline, diethanolamine, dicyclohexylamine, ethylenediamine, N- 
methylglucamine, and procaine (see, for example, Berge et al., "Pharmaceutical Salts," J. of 
Pharma Sci. (1977) 66:1-19). The base addition salts of said acidic compounds are prepared by 

20 contacting the free acid form with a sufficient amount of the desired base to produce the salt in the 
conventional manner. The free acid form may be regenerated by contacting the salt form with an 
acid and isolating the free acid in the conventional manner. The free acid forms differ from their 
respective salt forms somewhat in certain physical properties such as solubility in polar solvents, 
but otherwise the salts are equivalent to their respective free acid for purposes of the present 

25 invention. As used herein, a "pharmaceutical addition salt" includes a physiologically acceptable 
salt of an acid form of one of the components of the compositions of the invention. These include 
organic or inorganic acid salts of the amines. Preferred acid salts are the hydrochlorides, acetates, 
salicylates, nitrates and phosphates. Other suitable physiologically acceptable salts are well known 
to those skilled in the art and include basic salts of a variety of inorganic and organic acids, such 

30 as, for example, with inorganic acids, such as for example hydrochloric acid, hydrobromic acid, 
sulfuric acid or phosphoric acid; with organic carboxylic, sulfonic, sulfo or phospho acids or N- 
substituted sulfamic acids, for example acetic acid, propionic acid, glycolic acid, succinic afcid, 
maleic acid, hydroxymaleic acid, methylmaleic acid, fumaric acid, malic acid, tartaric acid, lactic 
acid, oxalic acid, gluconic acid, glucaric acid, glucuronic acid, citric acid, benzoic acid, cinnamic 

35 acid, mandelic acid, salicylic acid, ^aminosalicylic acid, 2-phenoxybenzoic acid, 2- 

acetoxybenzoic acid, embonic acid, nicotinic acid or isonicotinic acid; and with amino acids, such 
as the 20 alpha-amino acids involved in the synthesis of proteins in nature, for example glutamic 
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acid or aspartic acid, and also with phenylacetic acid, methanesulfonic acid, ethanesulfonic acid, 2- 
hydroxyethanesulfonic acid, ethane-l,2-disulfonic acid, benzenesulfonic acid, 4- 
methylbenzenesulfonic acid, naphthalene-2-sulfonic acid, naphthalene-l,5-disulfonic acid, 2- or 3- 
phosphoglycerate, glucose-6-phosphate, N-cyclohexylsulfamic acid (with the formation of 
5 cyclamates), or with other acid organic compounds, such as ascorbic acid. Physiologically 
acceptable salts of compounds may also be prepared with a physiologically acceptable cation. 
Suitable physiologically acceptable cations are well known to those skilled in the art and include 
alkaline, alkaline earth, ammonium and quaternary ammonium cations. Carbonates or hydrogen 
carbonates are also possible. For oligonucleotides, preferred examples of physiologically 

10 acceptable salts include but are not limited to (a) salts formed with cations such as sodium, 
potassium, ammonium, magnesium, calcium, polyamines such as spermine and spermidine, etc.; 
(b) acid addition salts formed with inorganic acids, for example hydrochloric acid, hydrobromic 
acid, sulfuric acid, phosphoric acid, nitric acid and the like; (c) salts formed with organic acids 
such as, for example, acetic acid, oxalic acid, tartaric acid, succinic acid, maleic acid, fumaric acid, 

15 gluconic acid, citric acid, malic acid, ascorbic acid, benzoic acid, tannic acid, palmitic acid, alginic 
acid, polyglutamic acid, naphthalenesulfonic acid, methanesulfonic acid, p-toluenesulfonic acid, 
naphthalenedisulfonic acid, polygalacturonic acid, and the like; and (d) salts formed from 
elemental anions such as chlorine, bromine, and iodine. 

The term "prodrug" indicates a therapeutic agent that is prepared in an inactive form that is 

20 converted to an active form (i.e., drug) within the body or cells thereof by the action of endogenous 
en2ymes or other chemicals and/or conditions. In particular, prodrug versions of the . 
oligonucleotides of the invention are prepared as SATE [(S-acetyl-2-thioethyl) phosphate] 
derivatives according to the methods disclosed in WO 93/24510 to Gosselin et al., published Dec. 
9, 1993 or in WO 94/26764 and U.S. Pat. No. 5,770,713 to inbach ,et al. 

25 The pharmaceutical compositions utilized in this invention may be administered by any 

number of routes including, but not limited to: parenteral, intracranial, intraorbital, intracapsular, 
intraspinal, intracisternal, intrapulmonary, oral, intravenous, intramuscular, intra-arterial, 
intramedullary, intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, 
enteral, topical, sublingual, or rectal means. In addition to the active ingredients, these 

30 pharmaceutical compositions may contain suitable physiologically acceptable carriers comprising 
excipients and auxiliaries which facilitate processing of the active compounds into preparations 
which can be used pharmaceutical^. Further details on techniques for formulation and 
administration may be found in the latest edition of Remington's Pharmaceutical Sciences (Maack 
PublishingCo. Easton, Pa). 

35 Pharmaceutical compositions for oral administration can be formulated using 

physiologically acceptable carriers well known in the art in dosages suitable for oral 
administration. Such carriers enable the pharmaceutical compositions to be formulated as 
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powders, tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions, and the like, 

for ingestion by the patient. 

Pharmaceutical preparations for oral use can be obtained through combining active 

compounds with solid excipient, optionally grinding the resulting mixture,, and processing the 
5 mixture of granules, after adding suitable auxiliaries, if desired, to obtain tablets or dragee cores. 

Suitable excipients are carbohydrate or protein fillers, such as sugars, including lactose, sucrose, 

mannitol, or sorbitol; starch from corn, wheat, rice, potato, or other plants; cellulose, such as 

methyl cellulose, hydroxypropylmethyl-cellulose, or sodium carboxymethylcellulose; gums 

including arabic and tragacanth; and proteins such as gelatin and collagen. If desired, 
10 disintegrating or solubilizing agents may be added, such as the cross-linked polyvinyl pyrrolidone, 

agar, alginic acid, or a salt thereof, such as sodium alginate. 

Dragee cores may be used in conjunction with suitable coatings, such as concentrated 

sugar solutions, which may also contain gum arabic, talc, polyvinylpyrrolidone, carbopol gel, 

polyethylene glycol, and/or titaniumdioxide, lacquer solutions, and suitable organic solvents or 
15 solvent mixtures. Dyestuffs or pigments may be added to the tablets or dragee coatings for product 

identification or to characterize the quantity of active compound, i.e., dosage. 

Pharmaceutical preparations which can be used orally include push-fit capsules made of 

gelatin, as well as soft, sealed capsules made of gelatin and a coating, such as glycerol or sorbitol. 

Push-fit capsules can contain active ingredients mixed with a filler or binders, such as lactose or 
20 starches, lubricants, such as talc or magnesium stearate, and, optionally, stabilizers. In soft 

capsules, the active compounds may be dissolved or suspended in suitable liquids, such- as fatty 

oils, liquid, or liquidpolyethylene glycol with or without stabilizers. 

Formulations suitable for pulmonary or respiratory delivery include dry powders, liquid 

solutions or suspensions suitable for nebulization, and propellant formulations suitable for use in 
25 metered dose inhalers (MDrs). The preparation of such formulations is well described in the 

patent, scientific, and medical literatures, and the following descriptions are intended to be 

exemplary only. 

Dry powder formulations will have a particle size within a preferred range for deposition 
within the alveolar region of the lung, typically from 0.5 .mu.m to 5 .mu.m. Respirable powders of 

30 pharmaceutical compositions within the preferred size range can be produced by a variety of 
conventional techniques, such as jet-milling, spray-drying, solvent precipitation, and the like. Dry 
powders can then be administered to the patient in conventional dry powder inhalers (DPI's) that 
use the patient's inspiratory breath through the device to disperse the powder or in air-assisted 
devices that use an external power source to disperse the powder into an aerosol cloud, as 

35 described in U.S. Pat No. 5,458,135, the full disclosure of which is incorporated herein by 
reference. 

Dry powder devices typically require a powder mass in the range from about 1 mg to 10 
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mg to produce a single aerosolized dose, which may necessitate addition of a dry bulking powder 
to the pharmaceutical formulation. Preferred dry bulking powders include sucrose, lactose, 
trehalose, human serum albumin (HSA), and glycine. Other suitable dry bulking powders include 
cellobiose, dextrans, maltotriose, pectin, sodium citrate, sodium ascorbate, mannitol, and the like. 
5 Furthermore, stabilizing buffers and salts may be used. Other additives, such as chelating agents, 
peptidase inhibitors, and the like, which would facilitate the biological activity of the 
pharmaceutical composition once it is dissolved within the lung would be appropriate. For 
example, ethylenediaminetetraacetic acid (ETD A) would be useful as a chelator for divalent 
cations which are peptidase cofactors. 

10 Liquid formulations for use in nebulizer systems preferably employ slightly acidic buffers 

(pH 4-6) such as acetate, ascorbate, and citrate, at concentrations of 5 mM to 50 mM. These 
buffers can act as antioxidants. Physiologically acceptable components to enhance or maintain 
chemical stability include: antioxidants, chelating agents, protease inhibitors, isotonic modifiers, 
inert gases, and the like. A preferred type of nebulizer suitable for delivering such liquid 

15 formulations is described in U.S. Pat. No. 5,458,135, the disclosure of which is incorporated herein 
by reference. 

For use in MDrs, the pharmaceutical composition will be dissolved or suspended in a 
suitable aerosol propellant, such as a chlorofluorocarbon (CFC) or a hydrofluorocarbon (HFC). 
Suitable CFCs include trichloromonofluoromethane (propellant 1 1), dichlorotetrafluoromethane 

20 (propellant 1 14), and dichlorodifluoromethane (propellant 12). Suitable HFC ! s include 
tetrafluoroethane (HFC- 1 34a) and heptafluoropropane (HFC-227). 

Preferably, for incorporation into the aerosol propellant, the pharmaceutical composition 
will be processed into respirable particles as described for the dry powder formulations. The 
particles are then suspended in the propellant, typically being coated with a surfactant to enhance 

25 their dispersion. Suitable surfactants include oleic acid, sorbitan trioleate, and various long chain 
, diglycerides and phospholipids. Such aerosol propellant formulations may further include a lower 
alcohol, such as ethanol (up to 30% by weight) and other additives to maintain or enhance 
chemical stability and physiological acceptability (U.S. Patent 6,080,721, which disclosure is 
hereby incorporated by reference in its entirety). 

30 For topical or nasal administration, penetrants appropriate to the particular barrier to be 

permeated are used in the formulation. Such penetrants are generally known in the art. 

Pharmaceutical formulations suitable for parenteral administration may be formulated in 
aqueous solutions, preferably in physiologically compatible buffers such as Hanks solution, 
Ringer's solution, or physiologically buffered saline. Aqueous injection suspensions may contain 

3 5 substances which increase the viscosity of the suspension, such as sodium carboxymethylcellulose, 
sorbitol, or dextran. Additionally, suspensions of the active compounds may be prepared as 
appropriate oily injection suspensions. Suitable lipophilic solvents or vehicles include fatty oils 
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such as sesame oil, or synthetic fatty acid esters, such as ethyl oleate or triglycerides, or liposomes. 
Optionally, the suspension may also contain suitable stabilizers or agents which increase the 
solubility of the compounds to allow for the preparation of highly concentrated solutions. 

The pharmaceutical compositions of the present invention may be manufactured in a 
5 manner that is known in the art, e.g., by means of conventional mixing, dissolving, granulating, 
dragee-making, levigating, emulsifying, encapsulating, entrapping, or lyophilizing processes. 

The pharmaceutical composition may be provided as a salt and can be formed with many 
acids, including but not limited to, hydrochloric, sulfuric, acetic, lactic, tartaric, malic, succinic, 
etc. Salts tend to be more soluble in aqueous or other protonic solvents than are the corresponding 
10 free base forms. In other cases, the preferred preparation may be a lyophilized powder which may 
contain any or all of the following: 1-50 mM histidine, 0.1%-2% sucrose, and 2-7% mannitol, at a 
pH range of 4.5 to 5.5, that is combined with buffer prior to use. 

After pharmaceutical compositions have been prepared, they can be placed in an 
appropriate container and labeled for treatment of an indicated condition. For administration of a 
15 GENSET polypeptide, such labeling would include amount, frequency, and method of 
administration. 

Pharmaceutical compositions suitable for use in the invention include compositions wherein the 
active ingredients are contained in an effective amount to achieve the intended purpose. The 
determination of an effective dose is well within the capability of those skilled in the art. 

20 For any compound, the therapeutically effective dose can be estimated initially either in 

cell culture assays, e.g., of neoplastic cells, or in animal models, usually mice, rabbits, dogs, or 
pigs. The animal model may also be used to determine the appropriate concentration range and 
route of administration. Such information can then be used to determine useful doses and routes 
for administration in humans. 
.25 A therapeutically effective dose refers to that amount of active ingredient, for example a 

GENSET polypeptide or fragments thereof, antibodies specific to GENSET polypeptides, agonists, 
antagonists or inhibitors of GENSET polypeptides, which ameliorates the symptoms or condition. 
Therapeutic efficacy and toxicity may be determined by standard pharmaceutical procedures in cell 
cultures or experimental animals, e.g., ED50 (the dose therapeutically effective in 50% of the 

30 population) and LD50 (the dose lethal to 50% of the population). The dose ratio between 
therapeutic and toxic effects is the therapeutic index, and it can be expressed as the ratio, 
LD50/ED50. Pharmaceutical compositions which exhibit large therapeutic indices are preferred. 
The data obtained from cell culture assays and animal studies is used in formulating a range of 
dosage for human use. The dosage contained in such compositions is preferably within a range of 

35 circulating concentrations that include the ED50 with little or no toxicity. The dosage varies 
within this range depending upon the dosage form employed, sensitivity of the patient, and the 
route of administration. 
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The exact dosage will be determined by the practitioner, in light of factors related to the 
subject that requires treatment. Dosage and administration are adjusted to provide sufficient levels 
of the active moiety or to maintain the desired effect. Factors which may be taken into account 
include the severity of the disease state, general health of the subject, age, weight, and gender of 
5 the subject, diet, time and frequency of administration, drug combination(s), reaction sensitivities, 
and tolerance/response to therapy. Other factors that may be considered when evaluating the 
proper dosage include the chemical nature of the compound destined for delivery, the biological 
responses associated with the compound (both intended and coincidental) and anticipated 
contraindications. Additionally, the mode of delivery (including but not limited to systemic and/or 

10 local applications: oral, oral enteric, intramuscular injection, subcutaneous injection, intradermal 
injection, interarticular space, intravascular injection, intravenous infusion, suppository, topical 
preparation, transdermal system), the duration and frequency of administration (e.g. n doses per 
hours, n doses per day, n doses per week, cumulative dosage per day, cumulative dosage per 
week), the biologically effective dose delivered to target site, often indicated by plasma level 

15 concentrations, and the rate or efficiency of compound clearance from the body may be 

considered. Long-acting pharmaceutical compositions maybe administered every 3 to 4 days, 
every week, or once every two weeks depending on half-life and clearance rate of the particular 
formulation. 

Normal dosage amounts may vary depending upon the route of administration. Guidance 
20 as to particular dosages and methods of delivery is provided in the literature and generally 

available to practitioners in the art. Those skilled in the art will employ different formulations for 
nucleotides than for proteins or their inhibitors. Similarly, delivery of polynucleotides or 
polypeptides will be specific to particular cells, conditions, locations, etc. In general, for a 75kg 
individual the normal dosage range are as follows: for a small molecule compound an effective 
25 does is usually between 0.3-50 mg/kg; for recombinant polypeptides an effective dose is usually 
between 0.25-7.5 mg/kg; for compounds used for mediating humoral immune responses (e.g., 
polyvalent pneumococcal vaccine, Rho (D) immune globulin, Hepatitis B vaccine, anti-CD20 
antigen) the effective dose is usually between 0.0015-1.5 mg/kg; for hormone supplemental 
compounds (e.g. estradiol, norethindrone) the effective dose is usually between 0.0005-0.5 mg/kg 
30 depending upon delivery system utilized (e.g. transdermal, oral, topical). 

Transdermal delivery systems (e.g. estradiol transdermal system, transdermal scopolamine 
system, transfermal nicotine patch) must be calibrated for nominal delivery dosages based upon 
efficiency of percutaneous delivery for the individual and specific compounds, surface area (cm 2 ) 
35 of transdermal system contact, quantity and form of compound integrated into transdermal delivery 
system and anatomical location of positioned transdermal system. The effective dosage range of 
compounds admistered in this manner is usually between 0.005-0.5 mg/kg 
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Uses of Genset Sequences: Computer-Related Embodiments 
As used herein the term "GENSET cDNAs" encompasses the nucleotide sequences of the 
present invention. 

5 It will be appreciated by those skilled in the art that the nucleic acid codes of the invention 

and polypeptide codes of the invention can be stored, recorded, and manipulated on any medium 
which can be read and accessed by a computer. As used herein, the words "recorded" and "stored" 
refer to a process for storing information on a computer medium. A skilled artisan can readily 
adopt any of the presently known methods for recording information on a computer readable 

10 medium to generate manufactures comprising one or more of the nucleic acid codes of the 

invention, or one or more of the polypeptide codes of the invention. Another aspect of the present 
invention is a computer readable medium having recorded thereon at least 1, 2, 5, 10, 15, 20, 25, 
30, or 50 nucleic acid codes of the invention. Another aspect of the present invention is a 
computer readable medium having recorded thereon at least 1, 2, 5, 10, 15, 20,25, 30, or 50 

1 5 polypeptide codes of the invention. 

Computer readable media include magnetically readable media, optically readable media, 
electronically readable media and magnetic/optical media. For example, the computer readable 
media may be a hard disk, a floppy disk, a magnetic tape, CD-ROM, Digital Versatile Disk 
(DVD), Random Access Memory (RAM), or Read Only Memory (ROM) as well as other types of 

20 other media known to those skilled in the art. 

Embodiments of the present invention include systems, particularly computer systems 
which store and manipulate the sequence information described herein. One example of a 
computer system 100 is illustrated in block diagram form in Figure 1. As used herein, "a computer 
system" refers to the hardware components, software components, and data storage components 

25 used to analyze the nucleotide sequences of the nucleic acid codes of the invention or the amino 
acid sequences of the polypeptide codes of the invention. In one embodiment, the computer 
system 100 is a Sun Enterprise 1000 server (Sun Microsystems, Palo Alto, CA). The computer 
system 100 preferably includes a processor for processing, accessing and manipulating the 
sequence data. The processor 105 can be any well-known type of central processing unit, such as 

30 the Pentium HI from Intel Corporation, or similar processor from Sun, Motorola, Compaq or 
International Business Machines. 

Preferably, the computer system 100 is a general purpose system that comprises the 
processor 105 and one or more internal data storage components 1 10 for storing data, and one or 
more data retrieving devices for retrieving the data stored on the data storage components. A 

35 skilled artisan can readily appreciate that any one of the currently available computer systems are 
suitable. 

In one particular embodiment, the computer system 100 includes a processor 105 
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connected to a bus which is connected to a main memory 1 15 (preferably implemented as RAM) 
and one or more internal data storage devices 110, such as a hard drive and/or other computer 
readable media having data recorded thereon. In some embodiments, the computer system 100 
further includes one or more data retrieving device 1 18 for reading the data stored on the internal 
5 data storage devices 1 10. 

The data retrieving device 118 may represent, for example, a floppy disk drive, a compact 
disk drive, a magnetic tape drive, etc. In some embodiments, the internal data storage device 110 
is a removable computer readable medium such as a floppy disk, a compact disk, a magnetic tape, 
etc. containing control logic and/or data recorded thereon. The computer system 100 may 

10 advantageously include or be programmed by appropriate software for reading the control logic 
and/or the data from the data storage component once inserted in the data retrieving device. 

The computer system 100 includes a display 120 which is used to display output to a 
computer user. It should also be noted that the computer system 100 can be linked to other 
computer systems 125a-c in a network or wide area network to provide centralized access to the 

1 5 computer system 1 00. 

Software for accessing and processing the nucleotide sequences of the nucleic acid codes of the 
invention or the amino acid sequences of the polypeptide codes of the invention (such as search 
tools, compare tools, and modeling tools etc.) may reside in main memory 1 15 during execution. 
In some embodiments, the computer system 100 may further comprise a sequence 

20 comparer for comparing the above-described nucleic acid codes of the invention or the polypeptide 
codes of the invention stored on a computer readable medium to reference nucleotide or 
polypeptide sequences stored on a computer readable medium. A "sequence comparer" refers to 
one or more programs which are implemented on the computer system 100 to compare a 
nucleotide or polypeptide sequence with other nucleotide or polypeptide sequences and/or 

25 compounds including but not limited to peptides, peptidomimetics, and chemicals stored within the 
data storage means. For example, the sequence comparer may compare the nucleotide sequences 
of nucleic acid codes of the invention or the amino acid sequences of the polypeptide codes of the 
invention stored on a computer readable medium to reference sequences stored on a computer 
readable medium to identify homologies, motifs implicated in biological function, or structural 

30 motifs. The various sequence comparer programs identified elsewhere in this patent specification 
are particularly contemplated for use in this aspect of the invention. 

Figure 2 is a flow diagram illustrating one embodiment of a process 200 for comparing a 
new nucleotide or protein sequence with a database of sequences in order to determine the 
homology levels between the new sequence and the sequences in the database. The database of 

35 sequences can be a private database stored within the computer system 100, or a public database 
such as GENBANK, PIR OR SWISSPROT that is available through the Internet. 

The process 200 begins at a start state 201 and then moves to a state 202 wherein the new 
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sequence to be compared is stored to a memory in a computer system 100. As discussed above, 
the memory could be any type of memory, including RAM or an internal storage device. 

The process 200 then moves to a state 204 wherein a database of sequences is opened for 
analysis and comparison. The process 200 then moves to a state 206 wherein the first sequence 
5 stored in the database is read into a memory on the computer. A comparison is then performed at a 
state 210 to determine if the first sequence is the same as the second sequence. It is important to 
note that this step is not limited to performing an exact comparison between the new sequence and 
the first sequence in the database. Well-known methods are known to those of skill in the art for 
comparing two nucleotide or protein sequences, even if they are not identical. For example, gaps 

10 can be introduced into one sequence in order to raise the homology level between the two tested 
sequences. The parameters that control whether gaps or other features are introduced into a 
sequence during comparison are normally entered by the user of the computer system. 

Once a comparison of the two sequences has been performed at the state 210, a 
determination is made at a decision state 210 whether the two sequences are the same. Of course, 

15 the term "same" is not limited to sequences that are absolutely identical. Sequences that are within 
the homology parameters entered by the user will be marked as "same" in the process 200. 

If a determination is made that the two sequences are the same, the process 200 moves to a 
state 214 wherein the name of the sequence from the database is displayed to the user. This state 
notifies the user that the sequence with the displayed name fulfills the homology constraints that 

20 were entered. Once the name of the stored sequence is displayed to the user, the process 200 : 
moves to a decision state 218 wherein a determination is made whether more sequences exist in the 
database. If no more sequences exist in the database, then the process 200 terminates at an end 
state 220. However, if more sequences do exist in the database, then the process 200 moves to a 
state 224 wherein a pointer is moved to the next sequence in the database so that it can be 

25 compared to the new sequence. In this manner, the new sequence is aligned and compared with 
every sequence in the database. 

It should be noted that if a determination had been made at the decision state 212 that the 
sequences were not homologous, then the process 200 would move immediately to the decision 
• state 218 in order to determine if any other sequences were available in the database for 

30 comparison. 

Accordingly, one aspect of the present invention is a computer system comprising a 
processor, a data storage device having stored thereon a nucleic acid code of the invention or a 
polypeptide code of the invention, a data storage device having retrievably stored thereon reference 
nucleotide sequences or polypeptide sequences to be compared to the nucleic acid code of the 
35 invention or polypeptide code of the invention and a sequence comparer for conducting the 
comparison. The sequence comparer may indicate a homology level between the sequences 
compared or identify motifs implicated in biological function and structural motifs in the nucleic 
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acid code of the invention and polypeptide codes of the invention or it may identify structural 
motifs in sequences which are compared to these nucleic acid codes and polypeptide codes. In 
some embodiments, the data storage device may have stored thereon the sequences of at least 2, 5, 
10, 15, 20, 25, 30, or 50 of the nucleic acid codes of the invention or polypeptide codes of the 
5 invention. 

Another aspect of the present invention is a method for determining the level of homology 
between a nucleic acid code of the invention and a reference nucleotide sequence, comprising the 
steps of reading the nucleic acid code and the reference nucleotide sequence through the use of a 
computer program which determines homology levels and determining homology between the 

10 nucleic acid code and the reference nucleotide sequence with the computer program. The 
computer program may be any of a number of computer programs for determining homology 
levels, including those specifically enumerated herein, including BLAST2N with the default 
parameters or with any modified parameters. The method may be implemented using the computer 
systems described above. The method may also be performed by reading 2, 5, 10, 15, 20, 25, 30, 

15 or 50 of the above described nucleic acid codes of the invention through the use of the computer 
program and determining homology between the nucleic acid codes and reference nucleotide 
sequences. 

Figure 3 is a flow diagram illustrating one embodiment of a process 250 in a computer for 
determining whether two sequences are homologous. The process 250 begins at a start state 252 

20 and then moves to a state 254 wherein a first sequence to be compared is stored to a memory. The 
second sequence to be compared is then stored to a memory at a state 256. The process 250 then 
moves to a state 260 wherein the first character in the first sequence is read and then to a state 262 
wherein the first character of the second sequence is read. It should be understood that if the 
sequence is a nucleotide sequence, then the character would normally be either A, T, C, G or U. If 

25 the sequence is a protein sequence, then it should be in the single letter amino acid code so that the 
first and sequence sequences can be easily compared. 

A determination is then made at a decision state 264 whether the two characters are the 
same. If they are the same, then the process 250 moves to a state 268 wherein the next characters 
in the first and second sequences are read. A determination is then made whether the next 

30 characters are the same. If they are, then the process 250 continues this loop until two characters 
are not the same. If a determination is made that the next two characters are not the same, the 
process 250 moves to a decision state 274 to determine whether there are any more characters 
either sequence to read. 

If there are no more characters to read, then the process 250 moves to a state 276 wherein 

35 the level of homology between the first and second sequences is displayed to the user. The level of 
homology is determined by calculating the proportion of characters between the sequences that 
were the same out of the total number of sequences in the first sequence. Thus, if every character 
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in a first 100 nucleotide sequence aligned with a every character in a second sequence, the 
homology level would be 100%. 

Alternatively, the computer program may be a computer program which compares the 
nucleotide sequences of the nucleic acid codes of the present invention, to reference nucleotide 
5 sequences in order to determine whether the nucleic acid code of the invention differs from a 
reference nucleic acid sequence at one or more positions. Optionally such a program records the 
length and identity of inserted, deleted or substituted nucleotides with respect to the sequence of 
either the reference polynucleotide or the nucleic acid code of the invention. In one embodiment, 
the computer program may be a program which determines whether the nucleotide sequences of 

10 the nucleic acid codes of the invention contain one or more single nucleotide polymorphisms 
(SNP) with respect to a reference nucleotide sequence. These single nucleotide polymorphisms 
. may each comprise a single base substitution, insertion, or deletion. 

Another aspect of the present invention is a method for determining the level of homology 
between a polypeptide code of the invention and a reference polypeptide sequence, comprising the 

1 5 steps of reading the polypeptide code of the invention and the reference polypeptide sequence 
through use of a computer program which determines homology levels and determining homology 
between the polypeptide code and the reference polypeptide sequence using the computer program. 

Accordingly, another aspect of the present invention is a method for determining whether a 
nucleic acid code of the invention differs at one or more nucleotides from a reference nucleotide 

20 sequence comprising the steps of reading the nucleic acid code and the reference nucleotide 
sequence through use of a computer program which identifies differences between nucleic acid 
sequences and identifying differences between the nucleic acid code and the reference nucleotide 
sequence with the computer program. In some embodiments, the computer program is a program 
which identifies single nucleotide polymorphisms. The method may be implemented by the 

25 computer systems described above and the method illustrated in Figure 3 . The method may also be 
performed by reading at least 2, 5, 10, 15, 20, 25, 30, or 50 of the nucleic acid codes of the 
invention and the reference nucleotide sequences through the use of the computer program and 
identifying differences between the nucleic acid codes and the reference nucleotide sequences with 
the computer program. 

30 In other embodiments the computer based system may further comprise an identifier for 

identifying features within the nucleotide sequences of the nucleic acid codes of the invention or 
the amino acid sequences of the polypeptide codes of the invention. An "identifier" refers to one 
or more programs which identifies certain features within the above-described nucleotide 
sequences of the nucleic acid codes of the invention or the amino acid sequences of the 

35 polypeptide codes of the invention. In one embodiment, the identifier may comprise a program 
which identifies an open reading frame in the cDNAs codes of the invention. 

Figure 4 is a flow diagram illustrating one embodiment of an identifier process 300 for 
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detecting the presence of a feature in a sequence. The process 300 begins at a start state 302 and 
then moves to a state 304 wherein a first sequence that is to be checked for features is stored to a 
memory 1 15 in the computer system 100. The process 300 then moves to a state 306 wherein a 
database of sequence features is opened. Such a database would include a list of each feature's 
5 attributes along with the name of the feature. For example, a feature name could be "Initiation 
Codon" and the attribute would be "ATG" Another example would be the feature name 
"TAATAA Box" and the feature attribute would be "TAATAA". An example of such a database 
is produced by the University of Wisconsin Genetics Computer Group (www.gcg.com). 

Once the database of features is opened at the state 306, the process 300 moves to a state 

10 308 wherein the first feature is read from the database. A comparison of the attribute of the first 
feature with the first sequence is then made at a state 3 1 0. A determination is then made at a 
decision state 316 whether the attribute of the feature was found in the first sequence. If the 
attribute was found, then the process 300 moves to a state 318 wherein the name of the found 
feature is displayed to the user. 

15 The process 300 then moves to a decision state 320 wherein a determination is made 

whether move features exist in the database. If no more features do exist, then the process 300 
terminates at an end state 324. However, if more features do exist in the database, then the process 
300 reads the next sequence feature at a state 326 and loops back to the state 310 wherein the 
attribute of the next feature is compared against the first sequence. 

20 It should be noted, that if the feature attribute is not found in the first sequence at the 

decision state 316, the process 300 moves direcdy to the decision state 320 in order to determine if 
any more features exist in the database. 

In another embodiment, the identifier may comprise a molecular modeling program which 
determines the 3-dimensional structure of the polypeptides codes of the invention. Such programs 

25 may use any methods known to those skilled in the art including methods based on homology- 
modeling, fold recognition and ab initio methods as described in Sternberg et al, (1999) Curr Opin 
Struct Biol. 9(3):368-73, which disclosure is hereby incorporated by reference in its entirety. In 
some embodiments, the molecular modeling program identifies target sequences that are most 
compatible with profiles representing the structural environments of the residues in known three- 

30 dimensional protein structures. (See, e.g., Eisenberg et al 9 U.S. Patent No. 5,436,850 issued July 
25, 1995, which disclosure is hereby incorporated by reference in its entirety). In another 
technique, the known three-dimensional structures of proteins in a given family are superimposed 
to define the structurally conserved regions in that family. This protein modeling technique also 
uses the known three-dimensional structure of a homologous protein to approximate the structure 

35 of the polypeptide codes of the invention. (See e.g., Srinivasan, et a/., U.S. Patent No. 5,557,535 
issued September 17, 1996, which disclosure is hereby incorporated by reference in its entirety). 
Conventional homology modeling techniques have been used routinely to build models of 
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proteases and antibodies. [Sowdhamini et al (1997), Protein Engineering 10:207, 215]. 
Comparative approaches can also be used to develop three-dimensional protein models when the 
protein of interest has poor sequence identity to template proteins, hi some cases, proteins fold 
into similar three-dimensional structures despite having very weak sequence identities. For 

5 example, the three-dimensional structures of a number of helical cytokines fold in similar three- 
dimensional topology in spite of weak sequence homology. 

The recent development of threading methods now enables the identification of likely 
folding patterns in a number of situations where the structural relatedness between target and 
template(s) is not detectable at the sequence level. Hybrid methods, in which fold recognition is 

10 performed using Multiple Sequence Threading (MST), structural equivalencies are deduced from 
the threading output using a distance geometry program DRAGON to construct a low resolution 
model, and a full-atom representation is constructed using a molecular modeling package such as 
QUANTA. 

According to this 3-step approach, candidate templates are first identified by using the 

15 novel fold recognition algorithm MST, which is capable of performing simultaneous threading of 
multiple aligned sequences onto one or more 3-D structures. In a second step, the structural 
equivalencies obtained from the MST output are converted into interresidue distance restraints and 
fed into the distance geometry program DRAGON, together with auxiliary information obtained 
from secondary structure predictions. The program combines the restraints in an unbiased manner 

20 and rapidly generates a large number of low resolution model confirmations. In a third step, these 
low resolution model confirmations are converted into Ml-atom models and subjected to energy 
minimization using the molecular modeling package QUANTA. (See e.g., Asz6di et aL, (1997) 
Proteins: Structure, Function, and Genetics, Supplement 1:38-42). 

The results of the molecular modeling analysis may then.be used in rational drug design 

25 techniques to identify agents which modulate the activity of the polypeptide codes of the invention. 
Accordingly, another aspect of the present invention is a method of identifying a feature 
within the nucleic acid codes of the invention or the polypeptide codes of the invention comprising 
reading the nucleic acid code(s) or the polypeptide code(s) through the use of a computer program 
'which identifies features therein and identifying features within the nucleic acid code(s) or 

30 polypeptide code(s) with the computer program. In one embodiment, computer program comprises 
a computer program which identifies open reading frames, hi a further embodiment, the computer 
program identifies linear or structural motifs in a polypeptide sequence. In another embodiment, 
the computer program comprises a molecular modeling program. The method may be performed 
by reading a single sequence or at least 2, 5, 10, 15, 20, 25, 30, or 50 of the nucleic acid codes of 

35 the invention or the polypeptide codes of the invention through the use of the computer program 
and identifying features within the nucleic acid codes or polypeptide codes with the computer 
program. 
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The nucleic acid codes of the invention or the polypeptide codes of the invention may he 
stored and manipulated in a variety of data processor programs in a variety of formats. For 
example, they may be stored as text in a word processing file, such as MicrosoftWORD or 
WORDPERFECT or as an ASCII file in a variety of database programs familiar to those of skill in 
5 the art, such as DB2, SYBASE, or ORACLE. Li addition, many computer programs and databases 
may be used as sequence comparers, identifiers, or sources of reference nucleotide or polypeptide 
sequences to be compared to the nucleic acid codes of the invention or the polypeptide codes of the 
invention. The following list is intended not to limit the invention but to provide guidance to 
programs and databases which are useful with the nucleic acid codes of the invention or the 

10 polypeptide codes of the invention. The programs and databases which may be used include, but 
are not limited to: MacPattern (EMBL), DiscoveryBase (Molecular Applications Group), 
GeneMine (Molecular Applications Group), Look (Molecular Applications Group), MacLook 
(Molecular Applications Group), BLAST and BLAST2 (NCBI), BLASTN and BLASTX (Altschul 
et al, 1990), FASTA (Pearson and Lipman, 1988), FASTDB (Brutlag et al., 1990), Catalyst 

15 (Molecular Simulations Lie), Catalyst/SHAPE (Molecular Simulations Inc.), Cerius2.DBAccess 
(Molecular Simulations Inc.), HypoGen (Molecular Simulations Lie), Insight n, (Molecular 
Simulations Inc.), Discover (Molecular Simulations Lie), CHARMm (Molecular Simulations 
Lie), Felix (Molecular Simulations Lie), DelPhi, (Molecular Simulations Lie), QuanteMM, 
(Molecular Simulations Lie), Homology (Molecular Simulations Lie), Modeler (Molecular 

20 Simulations Inc.), ISIS (Molecular Simulations Inc.), Quanta/Protein Design (Molecular 

Simulations Lie), WebLab (Molecular Simulations Lie), WebLab Diversity Explorer (Molecular 
Simulations Lie), Gene Explorer (Molecular Simulations Lie), SeqFold (Molecular Simulations 
Lie), the EMBL/Swissprotein database, the MDL Available Chemicals Directory database, the 
MDL Drug Data Report data base, the Comprehensive Medicinal Chemistry database, Derwents's 

25 World Drug Index database, the BioByteMasterFile database, the Genbank database, and the 
Genseqn database. Many other programs and data bases would be apparent to one of skill in the 
art given the present disclosure. 

Motifs which may be detected using the above programs include sequences encoding 
leucine zippers, helix-turn-helix motifs, glycosylation sites, ubiquitination sites, alpha helices, and 

30 beta sheets, signal sequences encoding signal peptides which direct the secretion of the encoded 
, proteins, sequences implicated in transcription regulation such as homeoboxes, acidic stretches, 
enzymatic active sites, substrate binding sites, and enzymatic cleavage sites. 



Conclusion 

35 As discussed above, the GENSET polynucleotides and polypeptides of the present 

invention or fragments thereof can be used for various purposes. The polynucleotides can be used 
to express recombinant protein for analysis, characterization or therapeutic use; as markers for 
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tissues in which the corresponding protein is preferentially expressed (either constitutively or at a 
particular stage of tissue differentiation or development or in disease states); as molecular weight 
markers on Southern gels; as chromosome markers or tags (when labeled) to identify chromosomes 
or to map related gene positions; as a reagent (including a labeled reagent) in assays designed to 
5 quantitatively determine levels of GENSET expression in biological samples; to compare with 
endogenous DNA sequences in patients to identify potential genetic disorders; as probes to 
hybridize and thus discover novel, related DNA sequences; as a source of information to derive 
PCR primers for genetic fingerprinting; for selecting and making oligomers for attachment to a 
"gene chip" or other support, including for examination for expression patterns; to raise anti- 

10 protein antibodies using DNA immunization techniques; and as an antigen to raise anti-DNA 
antibodies or elicit another immune response. Where the polynucleotide encodes a protein which 
binds or potentially binds to another protein (such as, for example, in a receptor-ligand interaction), 
the polynucleotide can also be used in interaction trap assays (such as, for example, that described 
in Gyuris et al y (1993) Cell 75:791-803 to identify polynucleotides encoding the other protein with 

1 5. which binding occurs or to identify inhibitors of the binding interaction. 

The proteins or polypeptides provided by the present invention can similarly be used in 
assays to determine biological activity, including in a panel of multiple proteins for high- 
throughput screening; to raise antibodies or to elicit another immune response; as a reagent 
(including the labeled reagent) in assays designed to quantitatively determine levels of the protein 

20 (or its receptor) in biological fluids; as markers for tissues in which the corresponding protein is 
preferentially expressed (either constitutively or at a particular stage of tissue differentiation or 
development or in a disease state); and, of course, to isolate correlative receptors or ligands. 
Where the protein binds or potentially binds to another protein (such as, for example, in a receptor- 
ligand interaction), the protein can be used to identify the other protein with which binding occurs 

25 or to identify inhibitors of the binding interaction. Proteins involved in these binding interactions 
can also be used to screen for peptide or small molecule inhibitors or agonists of the binding 
interaction. 

Any or all of these research utilities are capable of being developed into reagent grade or 
kit format for commercialization as research products. 

30 Methods for performing the uses listed above are well known to those skilled in the art. 

References disclosing such methods include without limitation 'Molecular Cloning; A Laboratory 
Manual", 2d ed., Cole Spring Harbor Laboratory Press, Sambrook, J., E.F. Fritsch and T. Maniatis 
eds., 1989, and "Methods in Enzymology; Guide to Molecular Cloning Techniques", Academic 
Press, Berger and Kimmel eds., 1987, which disclosures are hereby incorporated by reference in 

35 their entireties. 

Although this invention has been described in terms of certain preferred embodiments, 
other embodiments which will be apparent to those of ordinary skill in the art in view of the 
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disclosure herein are also within the scope of this invention. Accordingly, the scope of the 
invention is intended to be defined only by reference to the appended claims. 

EXAMPLES 

5 Example 1: Preparation of Antibody Compositions to the GENSET protein 

Substantially pure protein or polypeptide is isolated from transfected or transformed cells 
containing an expression vector encoding the GENSET protein or a portion thereof. The 
concentration of protein in the final preparation is adjusted, for example, by concentration on an 
Amicon filter device, to the level of a few micrograms/ml. Monoclonal or polyclonal antibody to the 
1 0 protein can then be prepared as follows: 

A. Monoclonal Antibody Production bv Hvbridoma Fusion 

Monoclonal antibody to epitopes in the GENSET protein or a portion thereof can be 
prepared from murine hybridomas according to the classical method of Kohler and Milstein, 
• (1975) Nature 256:495 or derivative methods thereof. Also see Harlow and Lane. (1988). 

1 5 Briefly, a mouse is repetitively inoculated with a few micrograms of the GENSET protein, 

or a portion thereof, over a period of a few weeks. The mouse is then sacrificed, and the antibody 
producing cells of the spleen isolated. The spleen cells are fused by means of polyethylene glycol 
with mouse myeloma cells, and the excess unfused cells destroyed by growth of the system on 
selective media comprising aminopterin (HAT media). The successfully fused cells are diluted 

20 and aliquots of the dilution pkced in wells of a microliter plate where growth of the culture is 
continued. Antibody-producing clones are identified by detection of antibody in the supernatant 
fluid of the wells by immunoassay procedures, such as ELISA, as originally described by Engvall, 
(1980) Meth. Enzymol. 70:419, which disclosure is hereby incorporated by reference in its 
entirety, and derivative methods thereof. Selected positive clones can be expanded and their 

25 monoclonal antibody product harvested for use. Detailed procedures for monoclonal antibody 
production are described in Davis, et al (1986) Section 21-2. 

B. Polyclonal Antibody Production bv Immunization 

Polyclonal antiserum containing antibodies to heterogeneous epitopes in the GENSET 
protein or a portion thereof can be prepared by immunizing suitable non-human animal with the 

30 GENSET protein or a portion thereof, which can be unmodified or modified to enhance 

immunogenicity. A suitable non-human animal is preferably a non-human mammal is selected, 
usually a mouse, rat, rabbit, goat, or horse. Alternatively, a crude preparation which has been 
enriched for GENSET concentration can be used to generate antibodies. Such proteins, fragments 
or preparations are introduced into the non-human mammal in the presence of an appropriate 

35 adjuvant (e.g. aluminum hydroxide, RBI, etc.) which is known in the art. In addition the protein, 
fragment or preparation can be pretreated with an agent which will increase antigenicity, such 
agents are known in the art and include, for example, methylated bovine serum albumin (mBSA), 
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bovine serum albumin (BSA), Hepatitis B surface antigen, and keyhole limpet hemocyanin (KLH). 

i Serum from the immunized animal is collected, treated and tested according to known procedures. 
If the serum contains polyclonal antibodies to undesired epitopes, the polyclonal antibodies can be 
purified by immunoaffinity chromatography. 
5 Effective polyclonal antibody production is affected by many factors related both to the 

antigen and the host species. Also, host animals vary in response to site of inoculations and dose, 
with both inadequate or excessive doses of antigen resulting in low titer antisera. Small doses (ng 
level) of antigen administered at multiple intradermal sites appears to be most reliable. Techniques 
for producing and processing polyclonal antisera are known in the art. An effective immunization 

10 protocol for rabbits can be found in Vaitukaitis et al 9 (1971) J. Clin. Endocrinol. Metab. 33:988- 
991, which disclosure is hereby incorporated by reference in its entirety. 

Booster injections can be given at regular intervals, and antiserum harvested when 
antibody titer thereof, as determined semi-quantitatively, for example, by double immunodiffusion 
in agar against known concentrations of the antigen, begins to fall. See, for example, Ouchterlony 

15 et aU (1973) Chap. 19 in: Handbook of Experimental Immunology D. Wier (ed) Blackwell, which 
disclosure is hereby incorporated by reference in its entirety. Plateau concentration of antibody is . 
usually in the range of 0.1 to 0.2 mg/ml of serum (about 12 uM). Affinity of the antisera for the 
antigen is determined by preparing competitive binding curves, as described, for example, by 
Fisher, (1980) Chap. 42 in: Manual of Clinical Immunology, 2d Ed. (Rose and Friedman,' Eds.) 

20 Amer. Soc. For Microbiol., Washington, D.C., which disclosure is hereby incorporated by 
reference in its entirety. 

Antibody preparations prepared according to either the monoclonal or the polyclonal 
protocol are useful in quantitative immunoassays which determine concentrations of antigen- 
bearing substances in biological samples; they are also used semi-quantitatively or qualitatively to 

25 identify the presence of antigen in a biological sample. The antibodies may also be used in 
therapeutic compositions for killing cells expressing the protein or reducing the levels of the 
protein in the body. 
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Table I 



SEO 

ro 

NO. 


Sequence 
Type 


Clone ID Clone Name 


Name 


ATCC 
Deposit 


ATCC 
Deposit Date 


1 


DNA 


223583_114-044-2-0-El 1-F 


S-lOOAlOrP 


PTA-2732 


Nov., 27, 
2000 


2 


Protein 


223583_1 14-044-2-0-E1 1-F 


S-lOOAlOrP 


PTA-2732 


Nov, 27, 
2000 


3 


DNA 


1000848582 181-40-4-0- ! 
All-F 


SCPhx 


PTA-2732 


Nov, 27, 
2000 


4 


Protein 


1000848582 181-40-4-0- 
All-F 


SCPhx 


PTA-2732 


Nov, 27, 
2000 


5 


DNA 


1000839315 220-26-1-0-F3- 
F 


Chimerin 


PTA-2732 


Nov, 27, 
2000 


6 


Protein 


1000839315 220-26-1-0-F3- 
F 


Chimerin 


PTA-2732 


Nov, 27, 
2000 


7 


DNA 


1000770704 208-27-3-0-G6- 
F 


CalX 


PTA-2732 


Nov, 27, 
2000 


. 8 


Protein 


1000770704 208-27-3-0-G6- 
F 


CaDC 


PTA-2732 


Nov, 27, 
2000 


9 


DNA 


147103_106-024-l-0-H6-F 


sLRPIO 


PTA-2534 


Sep, 27, 
2000 


10 


Protein 


147103_106-024-1-0-H6-F 


sLRPIO 


PTA-2534 


Sep, 27, 
2000 


11 


DNA 


224168_1 16-096-3-0-G1 1 -F 


sLRPIO 


PTA-2534 


Sep, 27, 
2000 


12 


Protein 


224168_1 16-096-3-0-G1 1-F 


sLRPIO 


PTA-2534 


Sep, 27, 
2000 


13 


DNA 


243303_1 16-11 8-4-0-A3-F 


sLRPIO 


PTA-2534 


Sep, 27, 
2000 


■ 14 


Protein 


243303_1 16-11 8-4-0-A3-F 


sLRPIO 


PTA-2534 


Sep, 27, 
2000 


15 


DNA 


225432_1 16-083-3-0-C6-F 


sLRPIO 


PTA-2534 


Sep, 27, 
2000 


! 16 


Protein 


225432J 16-083-3-0-C6-F 


sLRPIO 


PTA-2534 


Sep., 27, 
2000 


17 


DNA 


229633_1 14-049-1-0-D3-F 


STAMSAP 


PTA-2534 


Sep, 27, 
2000 


18 


Protein 


229633_1 14-049-1 -0-D3-F 


STAMSAP 


PTA-2534 


Sep, 27, 
2000 


19 


DNA 


158523J06-030-2-0-A3-F 


OAR 


PTA-2732 


Nov, 27, 
2000 


20 


Protein 


1 58523_106-030-2-0-A3-F 


OAR 


PTA-2732 


Nov, 27, 
2000 


21 


DNA 


589198_184-ll-l-0-E4-F 


COVI 


PTA-2732 


Nov, 27, 
2000 


22 


Protein 


589198_184-ll-l-0-E4-F 


COVI 


PTA-2732 


Nov, 27, 
2000 


23 


DNA 


47-14-1 -C3-CL0 5 


APIP 


98921 


Oct. 15, 1998 


24 


Protein 


47-14-1 -C3-CL0 5 


APIP 


98921 


98921 


25 


DNA 


545542J82-1-2-0-D12-F 


FGF-22 


PTA-2534 


Sep, 27, 
2000 


26 


Protein 


545542_182-l-2-0-D12-F 


FGF-22 


PTA-2534 


Sep, 27, 
2000 
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77 
Z/ 


TYVT A 

UINA 


1 1 ha n 1 1 ax firi£ /i n di i I? 
1 1 /4Ul_lUO-UUo-4-U-£>l l-r 


Frangiopoge 
n 


JrlA-z534 


Sep., 27, 
2000 


78 
Zo 


iToiein 


1 1 74ft i 1 n A_ftn£_4 n u 1 1 t? 
1 1 /4Ui_lUO-UUO-4-U-£>i l-r 


Frangiopoge 
n 


Tyr»A OCO/I 

ri A-2534 


Sep., 27, 
2000 


7Q 




177471 1 fts_ft07-4 ft m i t? 


Annapoptin 


JrlA-Z534 


Sep., 27, 
2000 


7n 


Protein 


177471 ifK_ftQ7_4 fi r:i i t? 
i 334 j i — iud-wz-4-u-oi l-r 


Armapoptin 


"DTP A 

r 1 A-2534 


Sep., 27, 
2000 


71 

31 


TYW A 


A777ftO 17/1_B 7_A^" 1 1A 17 


rretactilin 


PTA-2534 


Sep., 27, 
2000 


3Z 


Protein 


/1777HQ 17/1 O 7 ft /"MA T7 

4 / / /U9_l /4-o-z-0-d0-r 


rretactilin 


TVT> A oro j1 

PTA-2534 


Sep., 27, 
2000 


77 


TYKT A 
UNA 


1/tCj£ftJC 1ftiC ft7Q O A "DO T? 

1 45 o0o_l 0o-023 -2-0-B3 -r 


MS4A5 


TV-p a ^ r- o A 

PTA-2534 


Sep., 27, 
2000 


34 


Protein 


1 /f-C/TA^ 1 AiC AOI O A "0*5 T? 

1 4 j 6U6_l 06-023-2-0-B3-F 


X /TO A A £ 

MS4A5 


PTA-2534 


Sep., 27, 
2000 


35 


TYVT A 

UNA 


1 AAA*7^flC7C OAO 1 A TIO 

1000769575 208-22- 1-0-B2- 
F 


Antagimn 


PTA-2732 


Nov., 27, 
2000 


7£ 


rrotein 


1000 / 695 /5 208-22-1-0-B2- 
F 


Antaginin 


TVP A ^TTI 

PTA-2732 


Nov., 27, 
2000 


3 I 


LMNA 


1 4t>9 y4__l 0o-0z3 -4-0-C9-r 


Beferin 


TVT» A 

PTA-2732 


Nov., 27, 
2000 


3o 


Protein 


1 /1£QO/t 1ft/: ftTJ A ft Oft T? 

1 4oy !/4_ 1 0O-0Z3 -4-0-Ly-r 


Beferin 


TVT 1 A nnii 

PTA-2732 


Nov., 27, 
2000 


7ft 

3V 


TYM A- 


IftAftOOOTOO *1^Q vl A 17 "7 

1 00083 o7oo_228-28-4-0-r 7- 
r 


RP 


PTA-2732 


Nov., 27, 
2000 


4ft 
4U 


— : 

Protein 


innnc7Q70c 770 70 a a m 
1000o3o/oo 2zo-Zo-4-0-r /- 

F 


RP 


PTA-2732 


Nov., 27, 
2000 


41 


UViA 


1AftAft/f7A7^ 1 /Cft Oil O A 

ioooy43y/5 ioo-zi3-z-o- 
A5-F 


CCCDT 

SSSP1 


PTA-2732 


XT— OT 

Nov., 27, 
2000 


47 
4Z 


Protein 


1AAAO/17Q7^ 1 <CA 717 O A 

lUUUy43y/5 100-Z13-Z-0- 

A5-F 


IS b Sri 


rlA-2732 


XT"— ^1*7 

Nov., 27, 
2000 


47 
4.3 


TYM A 


147441 1 AX A7^ 7_A 1 17 
1 4 /44 1 1 00-OZ 5-Z-O-C 1 1 -r 


/-VDT 1 






44 


Protein 


147441 106-025-2-0-C11-F 


CPI-1 






43 


TYNTA 
iJiNA 


1 7/1 C 1 ft 117 ftm 1 ft TTC T? 

1Z4o10_1 13-003-3-0-ri5-r 


Rbl-A- 
MODULIN 


PTA-2732 


Nov., 27, 
2000 


4£ 
hO 


Protein 


1 74 A 1 A 117 Aft 7 7 IJ^ 17 
1Z401U__1 1 3-003-3 -0-il5-r 


OCT A 

MODULIN 


tv~t A nio 

PTA-2732 


XT— n 

Nov., 27, • 
2000 


47 


tyma 


lAAAOCCI/rc 7AC QQ 1 A A^ 
lUUUoDDlOD ZUj-yy-l-0-Aj- 

F 


Tifapinix 


r 1A-2732 


\T.„ /*)*7 

Nov., 27, 
2000 


48 


r ruiciii 


1ftftft8<^1£^ Oft^ 00 1 _fk A^ 

F 


Tifapinix 


rlA-2732 


Nov., 27, 
2000 






^8RAQ8 1 84-1 1 _4 A VIA V 
d o o\jy o 1 o4- 1 1 -4-u-xl4-r 


A AT 

crypAAl 


rlA-2732 


XT.,, T"7 

Nov., 27, 
2000 




Protein 


SR8AQ8 1 84 1 1 _4 O XX A T7 
JooUyo_154-l l-4-0-H4-r 


CrypAAl 


F1A-2732 


Nov., 27, 
2000 


51 


DNA 


500721700 204-43-4-0-H10- 
F 


1 llapiIllA- 

A58S 






52 


Protein 


500721700 204-43-4-0-H10- 
F 


Tifapinix- 
A58S 






53 


DNA 


789749J82-14-3-0-C12-F 


Plasminute 


PTA-2732 


Nov., 27, 
2000 


54 


Protein 


789749J82-14-3-0-C12-F 


Plasminute 


PTA-2732 


Nov., 27, 
2000 
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Table I continued. 





DNA 


S 19757 184-4 9-0-F7 F 


PAT QT/TM 


pTA 97^9 


iNOV., Z /, 

2000 


56 




SI 0757 1 84-4-9 -0-F7JF 


PAT QIOXT 


PTA 97^9 
r l t\rZ 1 jZ 


"NT/Mr 97 
INOV., Z/, 

2000 


57 


DNA 


625004 1 88-1 5-4-0-Wfi-F 


vPHT 1£A1 


PTA 9^34 


Qat» 97 

oep., z /, 
2000 


58 


Protein 


625004_188-15^t-0-H6-F 


VCOL16A1 


PTA-2534 


Sep., 27, 
2000 


5Q 




A99'*5'} 14^ 11 T77 T7 




r 1 A-zo.54 


■Sep., Li, 
2000 


60 


Protein 


422353_145-ll-3-0-E7-F 


NK5 


PTA-2534 


Sep., 27, 
2000 


01 


TYWA 


jUU/IdoZI L 5- J -U-Co- 
F 


PLasminogen 

Carrier 

Protein 

/T>t 

(rLLP) 


PTA-2534 


Sep., 27, 
2000 


62 


Protein 


500715621 204-15-3-0-C6- 
F 


PLasminogen 
Carrier 
Protein 
(PLCP) 


PTA-2534 


Sep., 27, 
2000 


63 


DNA 


1 65 843_1 16-008-4-0-G4-F 


Novel 

Calpastatin 1 
(NCI) 


PTA-2534 


Sep., 27, 

2000 


64 


Protein 


1 65843_1 1 6-008^-0-G4-F 


Novel 

Caipastatin 1 
(NCI) 


PTA-2534 


Sep., 27, 
2000 


65 


DNA 


335752_157-1 5-4-0-B 1 1 -F 


Novel 

Lalpastatin 2 
(NC2) 






66 


Protein 


335752_157-15-4-0-Bll-F 


Novel 

Calpastaun 2 

(JNCz) 






en 


TYWA 


£A.fifS\l 1521 \ t \ OJ\ T?9 T? 
0400U / l ol-lD -Z-V-tLZ-r 


Benzodiazepi 
ne Receptor 
2 (BZRP-R2) 


r 1 A-zj J4 


Sep., 27, 
2000 


68 


Protein 


646607_181-15-2-0-E2-F 


Benzodiazepi 
ne Receptor 
2 (BZRP-R2) 


PTA-2534 


Sep., 27, 
ZUOU 


69 


DNA 


229654 114-049-1-0-F12-F 


LAP 






70 


Protein 


229654 114-049-1-0-F12-F 


LAP 






71 


TYWA 


aaoi 1/: 17/1_1 1 P1A T? 


snort 
Histone 
Deacetylase 
(SHDAC) 


JrlA-2534 


oep., 27, 
2000 


79 
/Z 


rTotein 




{Snort 
Histone 

(SHDAC) 


r 1 A-ZD34 


Cam 1*7 

Sep., 27, 
2000 


73 


DNA 


500716683 204-24-2-0-D12- 
F 


Protease- 
associated 
Paraplegin 
(PAP) 


PTA-2534 


Sep., 27, 
2000 


74 


Protein 


500716683 204-24-2-0-D12- 
F 


Protease- 
associated 


PTA-2534 


Sep., 27, 
2000 
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Paraplegin 
(PAP) 







Table I continued.. 



75 


DNA 


500760207_205-58-4-0-H6-F 


Ketothiolase 
(KT) 






76 


Protein 


500760207_205-58-4-0-H6-F 


Ketothiolase 
(KT) 






77 


DNA 


122421 105-076-4-0-H1-F 


BASI2 






78 


Protein 


122421 105-076-4-0-H1-F 


BASE 






79 


DNA 


99483J05-016-1-0-D7-F 


KSPI1 


PTA-2534 


Sep., 27, 
2000 


80 


Protein 


99483J05-016-1-0-D7-F 


KSPI1 


PTA-2534 


Sep., 27, 
2000 


81 


DNA 


517778_184-5-3-0-G3-F 


Amyloid 
Apoptotic 
Receptor 
(AAR) 






82 


Protein 


517778_184-5-3-0-G3-F 


Amyloid 
Apoptotic 
Receptor 
(AAR) 






83 


DNA 


100038_105-017-4-0-E4-F 


Soluble 
Activator of 
Wntl 
(SAW-1) 






84 


Protein 


100038_105-017-4-0-E4-F 


Soluble 
Activator of 
Wntl 
(SAW-1) 






. 85 


DNA ! 


100523_105-019-1-0-F3-F 


Soluble 
Activator of 
Wntl 
(SAW-1) 






86 


Protein 


100523_105-019-l-0-F3-F 


Soluble 
Activator of 
Wntl 
(SAW-1) 






87 


DNA 


1 16470_105-063-3-0-H7-F 


Dopamine 
AMPhetamin 
e INhibitor 
(Dampin) 






88 


Protein 


1 16470_105-063-3-0-H7-F 


Dopamine 
AMPhetamin 
e INhibitor 
(Dampin) 






89 


DNA 


122600_105-077-3-0-F9-F 


Dopamine 
AMPhetamin 
e INhibitor 
(Dampin) 


PTA-2732 


Nov., 27, 
2000 


90 


Protein 


122600_105-077-3-0-F9-F 


Dopamine 
AMPhetamin 
e INhibitor 
(Dampin) 


PTA-2732 


Nov, 27, 
2000 


91 


DNA 


651658 181-35-2-0-C8-F 


VAGS 
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92 


Protein 


651658 181-35-2-0-C8-F 


VAGS 






93 


DNA 


150011 110-006-3-0-D5-F 


TFPI-C16Pfs 






94 


Protein 


150011 110-006-3-0-D5-F 


TFPI-C16Pfs 






95 


DNA 


5 0073 746 l_205-43 -3-0-r3 -b 


TFPI- 
M162Qfs 






96 


^rotein 


500737461_205-43-3-0-E3-F 


TFPI- 
M162Qfs 






97 


T*WT A 

DNA 


1 00545_l 05-0 1 9-2-0-E3 -r 


Soluble 
Activator of 
Wnt2 
(SAW-2) 






98 


Protein 


1 00545_105-01 9-2-0-E3-F 


Soluble 
Activator of 
Wnt2 
l.SAW-2) 






99 


DNA 


479155_174^M-0-C8-F 


ADEVAR 


PTA-2732 


Nov., 27, 
2000 


100 


Protein 


479 1 55 Ji 74-4-4-0-C8-F 


ADEVAR 


PTA-2732 


Nov., 27, 
2000 


101 


DNA 


586587_l8l-9-2-0-C5-F 


ATP-binding 
cassette 1, 
lABC 






102 


Protein 


586587_l8l-9-2-0-C5-F 


ATP-binding 

cassette, 

hABC 






103 


DNA 


6203 1 5_1 88-1 3-1 -0-G12-F 


MOBP-81h 


PTA-2534 


Sep., 27, 
2000 


104 


Protein 


620315_188-13-l-0-G12-F 


MOBP-81h 


PTA-2534 


Sep., 27, 
2000 


105 


DNA 


646477_1 8 1-1 9-2-0-F4-F 


novel 

Apolipoprote 
inH 

(NAPOH) 






106 


Protein 


646477J81-19-2-0-F4-F 


novel 

Apolipoprote 
inH 

(NAPOH) 






107 


DNA 


1 13 165_105-056-3-0-G12-F 


human 
JNK3- 
binding 
protein 
(iLlJNivj-rjr J 






108 


Protein 


1 13165J05-056-3-0-G12-F 


human 
JNK3- 
binding 
protein 






109 


DNA 


231462 117-065-1-0-G11-F 


DROCK2 






110 


Protein 


231462 117-065-1-0-GU-F 


DROCK2 






111 


DNA 


500723589 205-34-3-0-G4-F 


Novel 17 
beta- 

hydroxystero 
id 

dehydrogena 
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112 


Protein 


500723589_205-34-3-0-G4-F 


Novel 17 
beta- 

hydroxystero 
id 

dehydrogena 
se type 2 ( 
NBHSD2) 
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Table H 



SEQID 


ORF 


Signal 

x cpilUc 


Mature 


Polyadenylation 
Signal 


PoIyA tail 


1 


n 1 R761 
[1433-15JOJ 






[iyo3-iy /uj 


TOAA1 OA1 £Ti 

[ZOOl-zOloJ 


•3 
J 


T70-Q171 
[3y-y 1 / J 


T70 1 1 61 

[3y-i 101 


T1 17 Q171 

[ii /-y 1 / j 


[1U43-1U3UJ 


HA££ 1AQ11 

[IO00-IU0IJ 


3 


l"8A-7 1 71 
[o**-3 1 / J 


[o4-14UJ 


T1A1 7171 
[141-31 /J 


T707_/IA91 

[3y /-4UzJ 


[423-438] 


7 

1 


n? 7481 


T79 011 
[3Z-y 11 


TQ9 7A81 


[yzo-y33j 


[y33-yooj 


Q 


r9 c A_«741 
[Z3*f-3 / 4J 


r9C/t 7QC1 
[Z34-Zy3J 


T906 ^7A1 

[zyo-3 /4J 






1 1 
1 1 


VISA <7A1 
[Z34-3 /4J 


r9^4 90^1 
[Z34-Zy3J 


T706 ^7A1 

[zyo-3 /4J 






17 

ID 


[Z3*fr-3 / 4 J 


r7^4_7Q^l 
|Z34-Zy3 J 


[zyo-3 /4J 






13 


T7^A £74.1 
[Z3 4-3 /4J 


T7C/1 7QC1 
[Z34-Zy3J 


T70/C ^7A1 

[Zyo-3 /4J 






1 7 
J / 


T777 1 A1 71 
[3Z/-1U13 J 






rt in 11 «a/n 
[1 131-1 136J 


n 1 £.f\ 1 1 TCI 

[1160-1175] 


1 Q 


n 17 C1 71 
[1 1Z-S13J 


n 1 9 1 £91 
[1 IZ-loZJ 


n csi 0 1 7i 
[103-ol3J 






Zl 


T1T7 1A9A1 

[1Z/-1UZ0J 


T1 9*7 1 

[lz/-lo3J 


Tl QA 1 AOA1 

[154-1U20J 






Z3 


[lU-lZlZJ 


r 1 a /zni 
[lU-oUJ 


[ol-lzlzj 


Tl *7AA 1 Tl /II 

[1709-1714] 


[1733-1746] 


9C 
ZD 


T1 99 C701 

[iz/-o/yj 


n 0*7 loci 
[lz/-iy&| 


T1 AA OTAl 

[iyy-5/yj 




[1224-1239] 


Z / 


[l lo-yoi j 






[1145-1150] 


[1164-1179] 


9G 

zy 


r7/1C 1 1 1 01 

[343-1 1 loj 


[343-404J 


MAC 111 Ol 

[4U3-llloJ 




n 1 ao 111 oi • 

[1103-1118] 


j i 


[14-lU4oJ 


r 1 >t on 
[14-yiJ 


TOT 1A/101 

[yZ-lU4oj 


[1234-1239] 


[1258-1273] 


77 
jj 


T77 £791 
[/3-O/ZJ 






[689-694] 


r*7AO Tin 

[708-723] 


7c 


n 1 o^ccci 
11 iy-o33j 






TOAA Of At 

[809-814] 


[830-845] 


j / 


T17 9*^01 

[i /-Z3yj 










7Q 

3y 


[ZOU-iU4oJ 


T9£A 7 1 Ql 

Izouoiyj 


T79A 1 A/101 

[3ZU-lU4oJ 


[1782-1787J 


n OAi 101 £\ 
[1801-1816] 


41 


[yi-4ozj 


roi 1 OAT 

[y 1-1 80] 


n 0 1 acw 
[181-462J 


[607-612] 


[628-643] 


A7 
43 


[Zzo-301 J 


T99 0 ao/ci 
[z2o-326J 


ri^n f A11 

1327-501 ] 








roc 07A1 
[yo-y34j 










AH 

4/ 


n/C7 1 1 7Q1 

[zo/-ii3yj 


OCA! 

[ZO7-330J 


pa c 1 110 AT 
[331-1139] 


[1246-1251] 


n nft 1 on/in 

[1279-1294] 


4y 


MO 1 1 AA1 

[4 o-l IvA/J 


mo 1 1 01 
(4o-l iyj 


F1 OA 1 1 AAT 

[izU-llUUJ 


[1 159-1 164J 


[1 179-1 194J 


CI 


TOQA 1 1 £91 

[zyu-i lozj 


T9QA 7*771 

[zyu-3 / J J 


[3 /4-llo2J 


no <rr a 1 n ai 

[1269-1274] 


[1302-1317] 


C7 


[1U44-1004J 






[1869-1874J 


[ 1 892-1 907 J 


<\C 
33 


P96 67 81 
[ZO-OZoJ 






[760-771] 


[795-809] 


<7 
3 / 


T/176 Q6A1 
[4 / o-y 04 J 






T1 1 A1 11 AiCl 

[1 101-1 lOoJ 


n 1 1 0 11 0*21 
[1118-1133] 


3y 


T7Q-6A91 






non oa/ii 


ro-j caoi 
[o23-83oJ 


61 
Ol 


n S 0-7641 


n^Q 9911 

[i3y-zzij 


T999 76A1 
[ZZZ- /04J 






D3 


T1 0 C -SR71 

JvO / J 


no* 76m 
[iy3-zouj 


T7£1 ^R71 
[Z0i-3o/J 


rc70 con 
[3 /o-3o3J 


[o04-oloj 


6* 


T 177-7671 


T177 7/191 
[ 1 / / -Z4Z J 


T7>17 7671 
[Z43-/0/J 


ro 1 a 0 1 oi 
[ol4-oiyj 


TQ9T 07/Cl 

[oZz-o3oj 


u / 


T63-5771 
- j / 1 






f7C A 7CC1 
[/3U-/33J 


[/ /4-/oyj 


60 


T67-94971 


T67 1 1A1 
LO/-1 14J 


riic 7/1771 
[1 13-Z4Z/J 


T9C99 9CV71 

[Z3ZZ-Z3Z /J 


[Z341-Z330J 


71 


[8-763] 


[8-58] 


T59-7631 


F1 S65-1 S671 




73 


[9-395] 


[9-56] 


[57-395] 




[864-879] 


75 


[88-1269] 






[1594-1599] 


[1619-1634] 


77 


[69-875] 


[69-131] 


[132-875] 


[1599-1604] 


[1627-1642] 


79 


[344-1144] 


[344-433] 


[434-1144] 






81 


[27-689] 


[27-122] 


[123-689] 


[1302-1307] 


[1325-1406] 


83 


[118-510] 


[118-189] 


[190-510] 


[1718-1723] 


[1739-1754] 
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85 


[118-510] 


[118-189] 


[190-510] 


[1718-1723] 


[1739-1754] 


87 


[152-655] 






[1399-1404] 


[1416-1431] 


89 


[152-655] 






[1399-1404] 


[1416-1431] 


91 


[48-1301] 


[48-119] 


[120-1301] 


[1360-1365] 


[1402-1417] 


93 


[278-733] 


[278-334] 


[335-733] 


[1072-1077] 


[1101-1115] 


95 


[253-744] 


[253-336] 


[337-744] 


[1269-1274] 


[1292-1307] 


97 


[118-504] 


[118-189] 


[190-504] 


[1819-1824] 


[1840-1855] 


99 


[95-613] 






[636-641] 


[652-667] 


101 


[154-639] 






[1023-1028] 


[1047-1062] 


103 


[150-392] 








[63-933] j 


105 


[35-1069] 


[35-91] 


[92-1069] 


[1146-1151] 


[1172-1187] 


107 


[16-1449] 






[1483-1488] 


[1505-1520] 


109 


[95-1252] 


[95-139] 


[140-1252] 


[1751-1756] 


[1774-1789] 


111 


[103-1263] 






[1341-1346] 


[1365-1408] 
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Table DI 



SEQID 
NO: 


Positions of immunogenic epitopes 


2 


21..28:34..42:56..65:80..85:95..105:128..I33 


4 


32..39:57..66:78..84:92..105:152..157:165..171:262..270:277..287 


6 


23..33:34..41:49..63 


8 


42..48:53..69:76..94:145..154:165..171:179..188:186..200:229..238 


10 


11..20:36..55:63..70:79..94 


12 


11..20:36..55:63..70:79..94 


14 


11..20:36..55:63..70:79..94 


16 


11..20:36..55:63..70:79..94 


18 


10..22:80..91:100..110:122..128:134..141:151..162:160..173:191..202:216..227 


20 


21..28:54..62:70..81:83..91:95..101:110..124:134..139:180..190 


22 


20..29:33..39:43..53:82..92:253..264 


24 


16..27:87..97:152..159:169..175:178..188:213..221:273..282:308..313:339..347:385..395 


26 


45..55:52..63:106..117:118..128:126..131:148..155:157..164:172..190:212..221:232..247 


28 


44..53:55..65:82..90:93..114:119..132:148..163:174..179:176..181:199..219:218..228:242 
.. 253:272..278 


30 


1..6:41..46:92..102:133..139:143..163:161..181:185..195:214..221 


32 


53..77:120..130:144..159:159..169:196..202:266..272:331..344 


34 


147..157:189..199 


36 


113..125:139..151:149..160 


38 


1..8:49..63:66..76 


40 


27..35:106..111:183..194:222..228:241..247:255..262 


42 


38..49:49..54:71..82:92..116 


44 


1..19 


46 


1..8:9..14:70..80:85..92:110..116:145..158:202..216:231..246:244..253:262..276 


48 


57..63:85..96:104..111:114..121:127..142:159..169:169..178:185..191:206..214:213..222: 
228..250 


50 


58..67:116..125:149..154:188..193:213..218:233..241:332..339 


52 


56..63:85..96:104..111:114..121:127..142:159..169:169..178:185..191:206..214:213..222: 
228..250 


54 


21..30:124..137:147..159:181..189 


56 


55..64:80..86:l67..l74 


58 


3..l5:l2..42:40..66:75..85:90..l07:l23..l42:l47..l59 


60 


30..39:73..89:96..l02:l63..l87 


62 


20..3l:89..l0l:l06..ll6:157..l72:l80..l94 


64 


28..34:37..45:49..6l:6l..77:l02..l08 


66 


27..35:37..45:49..6l:6l..77:l02..l09:l44..l52:l70..l80:l79..l88 


68 


22..36:l5l..l56:l6l..l69 
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70 


19 24-34 45-79 94:100..107:146..152:161..168:174..179:199 204-238 246 259 269* 
329..342:380..393:39O..395:393..398:395..400:397..404:408..414:427..434:447..456:461 
..474:481..489:492..499:506..513:520..540:556..563:561..568:584..590:596..604:626.. 
632:629..634:634..656:654..659:675..681:714..731:730..743:745..766:768..786 


72 


97..1 10:234.-245 


74 


10..23:27..32:33..44:103..108:111..122 | 


76 


1 16..122: 182.. 188:205..215:223..23 1 :234..241 :35 1 ..356:364..374 


78 


67..73:71..85:142..148:176..195:229..237:236..246:248..268 


80 


25..44:54..61:93..99:99..108:107..123:129..144:164..172:176..185:203..210:214..221:225 
..233:243..253 


82 


42..48:84..93:104..118:122..132:141..147:153..161 


84 


42..51:76..94:97..126 


86 


42..51:76..94:97..126 


88 


6..14:13..23:25..39:36..42:59..67:79..86:110..120:123..132:133..145:153..167 


90 


6..14:13..23:25..39:36..42:59..67:79..86:110..120:123..132:133..145:153..167 


92 


25..33:31..48:65..73:125..134:183..192:216..221:255..260:280..285:300..308:400..405 


94 


48..54:76..87:95..102:107..115:118..125:131..141 


96 


57..63:85..96:104..111:116..124:127..134:140..155 


98 


42..51:76..94:116..123 


100 


26..33:46..54:104..1 17:125..130 


102 


15 23-44 55-52 62-77 83-83 88-115 124-132 148-145 156 


104 


2..23:34..39:41..46:50..60:67..80 


106 


21..30:40..50:49..62:99..106:123..133:156..169:189..198:197..205:203..216:224..232:232 
..246:300.31 5:336..344 


108 


9..20:33..52:68..75:91..97:123..130:175..189:186..193:195..204:216..227:229..234:246.. 
252:249..254:302..320:386..396:402..412:409..415:429..451 


110 


8..17:70..78:111..123:142..155:176..191:189..194:191..198:206..220:235..240:250..262: 
285..291:331..340:346..355 


112. 


25„35:115..131:207..214:230..235:272..278:291..298:313..318:336..345:362..374:377.. 
386 
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Table IV 



SEQ ED 
NO: 


Preferentially excluded fragments 


Preferentially included fragments 


1 


[l-507];[1524-2004] 


[508-1523];[2005-2016] 


3 


[l-477];[507-849];[85 1-1081] 


[478-506];[850-850] 


5 


[1-430] 


[431-438] 


7 


[1-816] 


[817-968] ; 


9 


[l-190];[205-336];[338-527] 


[191-204];[337-337];[528-730] 


11 


[l-190];[205-336];[338-527] 


[191-204];[337-337];[528-733] 


13 


[l-190];[205-336];[338-527] 


[191-204];[337-337];[528-732] 


15 


[l-190];[205-336];[338-527] 


[191-204];[337-337];[528-733] 


17 


[31-415];[417-476] 


[1-30];[416-416];[477-1175] 


19 


[l-239];[241-593];[673-732] 


[240-240];[594-672];[733-844] 


21 


[1-533];[1323-1455];[1459-1751] 


[534-1322];[1456-1458];[1752-1997] 


23 


[l-289];[291-320] 


[290-290];[321-1746] 


25 


[1-528] | 


[529-1239] 


27 


[1-417];[814-1162] 


[418-813];[1163-1179] 


29 


[l-172];[178-334] 


[173-177];[335-1118] 


31 


[l-122];[385-435] 


[123-384];[436-1816] 


33 


[1-585] 


[586-643] 


35 


[l-436];[44^487] 


[437-443] ; [488-501] 


37 


[l-71];[73-466] 


[72-72];[467-845] 


39 


[1-500] 


[501-517] 


41 


[l-575];[683-1045]; 
. [1047-1 141];[1 149-1 178] 


[576-682];[1046-1046]; 
[1 142-1 148];[1 179-1 194] 


43 


[1-558] 


[559-960] 


45 


[l-510];[533-572] 


[511-532];[573-1294] 


47 


[l-519];[523-552] 


[520-522];[553-1273] 


49 


[1-723] 


None 


51 


[l-533];[556-595] 


[534-555];[596-1317] 


53 


[l-64];[67-441];[1035-1306]; 
[1406-1488];[1514-1711]; 
[1713-1787];[1789-1892] 


[65-66];[442-1034];[1307-1405]; 
[1489-1513];[1712-1712]; 
[1788-1788];[1893-1907] 


55 


[1-483] 


[484-809] 


57 


[1-494] 


[495-1133] 


59 


s [2-523] 


[l-l];[524-838] 


61 


[1-427] 


[428-862] 


63 


[l-30];[125-299];[301-570] 


[31-124];[300-300];[571-618] 


65 


[14-105] 


[l-13];[106-836] 


67 


[l-293];[304-54l] 


[294-303];[542-789] 


69 


[l-466];[900-974] 


[467-899];[975-2556] 


71 


| [l-486];[526-560];[987-1588] 


[487-525];[561-986];[1589-1603] 


73 


j [l-435];[486-517];[599-708]; 
[728-803];[812-879] 


[436-485];[518-598];[709-727]; 
[804-811] 



388 



WO 02/094864 

Table IV continued.. 



PCT/IB01/01715 



75 


[1-465] 


[466-1634] 


77 


[2-394];[396-564];[681-832]; 
[1207-1294] 


[l-l];[395-395];[565-680]; 
[833-1206];[1295-1642] 


79 


[l-218];[220-591];[605-663] 


[219-219];[592-604];[664-1466] 


81 


[1-432] 


[433-1406] 


83 


[1-339] 


[340-1754] 


85 


[1-339] 


[340-1754] 


87 


[1-433];[1261-1355] 


[434-1260];[1356-1431] 


89 


[1-433];[1261-1355] 


[434-1260];[1356-1431] 


91 


[l-738];[884-1342];[1350-1380] 


[739-883];[1343-1349];[1381-1417] | 


93 


[1-494];[517-581] 


[495-516];[582-1115] 


95 


[l-189];[191-496];[519-583] 


[190-190];[497-518];[584-1307] 


97 


[1-339] 


[340-1855] 


99 


[l-405];[426-457] 


[406-425];[458-667] 


101 


[l-44];[666-753];[783-813]; 
[899-965];[981-1013] 


[45-665];[754-782];[814-898]; 
[966-980];[1014-1062] I 


103 


[l-77];[79-412];[418-456];[758-916] 


[78-78];[413-417];[457-757];[917-933] 


105 


[l-287];[289-635] 


[288-288];[636-1187] 


107 


[1-501];[680-719];[721-816]; 
[822-853]; [982-1 180]; 
[1182-1235];[1237-1383]; 
[1404-1520] 


[502-679],[720-720];[8 17-821]; 
[854-981];[l 181-1 181]; 
[1236-1236];[1384-1403] 


109 


[l-393];[409-503] 


[394-408];[504-1789] 


111 


[l-777];[779-860];[1365rl408] 


[778-778];[861-1364] 



389 
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WHAT IS CLAIMED IS: 

1 1. An isolated polynucleotide, comprising a nucleic acid sequence selected from the group 



2 consisting of: 

3 a) a polynucleotide of an even SEQ ID NO., or of a human cDNA of a deposited clone, 

4 encoding at least any single integer from 6 to 500 amino acids of any one odd SEQ ID 

5 NO, 

6 b) a polynucleotide of an even SEQ ID NO, or of a human cDNA of a deposited clone, 

7 encoding the signal peptide sequence of any one odd SEQ ID NO, 

8 c) a polynucleotide of an even SEQ ID NO, or of a human cDNA of a deposited clone, 

9 encoding a mature polypeptide sequence of any one odd SEQ ID NO, 

10 d) a polynucleotide of an even SEQ ID NO, or of a human cDNA of a deposited clone, 

1 1 encoding a full length polypeptide sequence of any one odd SEQ ID NO, 

12 e) a polynucleotide of an even SEQ ID NO, or of a human cDNA of a deposited clone, 

13 encoding a polypeptide sequence of a biologically active fragment of any one odd 

14 SEQ ID NO, 

15 f) a polynucleotide encoding a polypeptide sequence of at least any single integer from 6 

16 to 500 amino acids of any one odd SEQ ID NO. or of a polypeptide encoded by a 

1 7 human cDNA of a deposited clone, 

18 g) a polynucleotide encoding a polypeptide sequence of a signal peptide of any one odd 

19 SEQ ID NO. or of a signal peptide encoded by a human cDNA of a deposited clone, 

20 h) a polynucleotide encoding a polypeptide sequence of a mature polypeptide of any one 

21 odd SEQ ED NO. or of a mature polypeptide encoded by a human cDNA of a 

22 deposited clone, 

23 i) a polynucleotide encoding a polypeptide sequence of a full length polypeptide of any 

24 one odd SEQ ID NO. or of a mature polypeptide encoded by a human cDNA of a 

25 deposited clone, 

26 j) a polynucleotide encoding a polypeptide sequence of a biologically polypeptide of 

27 any one odd SEQ ID NO, or of a biologically polypeptide encoded by a human 

28 cDNA of a deposited clone, 

29 k) a polynucleotide of any one of a) through j) further comprising an expression vector, . 

30 1) a host cell recombinant for a polynucleotide of a) through k) above, 

31 m) a non-human transgenic animal comprising the host cell of k), 

32 n) a polynucleotide of a) through j) further comprising a physiologically acceptable 

33 carrier. 



1 2. A polypeptide comprising an amino acid sequence selected from the group consisting of: 
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2 a) any single integer from 6 to 500 amino acids of any one odd SEQ ED NO. or of a 

3 polypeptide encoded by a human cDNA of a deposited clone; 

4 b) a signal peptide sequence of any one odd SEQ ID NO. or encoded by a human cDNA 

5 of a deposited clone; 

6 c) a mature polypeptide sequence of any one odd SEQ ID NO. or encoded by a human 

7 cDNA of a deposited clone; 

8 d) a full length polypeptide sequence of any one odd SEQ ID NO. or encoded by a 

9 human cDNA of a deposited clone; 

10 e) a polypeptide of a) through d) further comprising a physiologically acceptable earner. 

1 3 , A method of making a polypeptide, said method comprising 

2 a) providing a population of host cells comprising the polynucleotide of claim 1 ; 

3 b) culturing said population of host cells under conditions conducive to the production of 

4 a polypeptide of claim 2 within said host cells; and 

5 c) purifying said polypeptide from said population of host cells. 

1 4. A method of making a polypeptide, said method comprising: 

2 a) providing a population of cells comprising a polynucleotide encoding the polypeptide 

3 of claim 2, operably linked to a promoter; 

4 b) culturing said population of cells under conditions conducive to the production of said 

5 polypeptide within said cells; and 

6 c) purifying said polypeptide from said population of cells. 

1 5 . An antibody that specifically binds to the polypeptide of claim 2. 

1 6* A method of binding a polypeptide of claim 2 to an antibody of claim 5, comprising contacting 

2 said antibody with said polypeptide under conditions in which antibody can specifically bind to 

3 said polypeptide. 

1 7. A method of determining whether a GENSET gene is expressed within a mammal, said method 

2 comprising the steps of: 

3 a) providing a biological sample from said mammal 

4 b) contacting said biological sample with either of: 

5 i) a polynucleotide that hybridizes under stringent conditions to the 

6 polynucleotide of claim 1 ; or 

7 ii) a polypeptide that specifically binds to the polypeptide of claim 2; and 

8 c) detecting the presence or absence of hybridization between said polynucleotide 
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9 and an RNA species within said sample, or the presence or absence of binding 

10 of said polypeptide to a protein within said sample; 

1 1 wherein a detection of said hybridization or of said binding indicates that said GENSET gene is 

12 expressed within said mammal. 

1 8. The method of claim 7, wherein said polynucleotide is a primer, and wherein said hybridization 

2 is detected by detecting the presence of an amplification product comprising the sequence of 

3 said primer. 

1 9. The method of claim 7, wherein said polypeptide is an antibody. 

1 10. A method of determining whether a mammal has an elevated or reduced level of GENSET gene 

2 expression, said method comprising the steps of: 

3 a) providing a biological sample from said mammal; and 

4 b) comparing the amount of the polypeptide of claim 2, or of an KNA species 

5 encoding said polypeptide, within said biological sample with a level 

6 detected in or expected from a control sample; 

7 wherein an increased amount of said polypeptide or said RNA species within said biological 

8 sample compared to said level detected in or expected from said control sample indicates that 

9 said mammal has an elevated level of said GENSET gene expression, and wherein a decreased 

10 amount of said polypeptide or said RNA species within said biological sample compared to said 

1 1 level detected in or expected from said control sample indicates that said mammal has a reduced 

12 level of said GENSET gene expression. 

1 1 1 . A method of identifying a candidate modulator of a GENSET polypeptide, said method 

2 comprising: 

3 a) contacting the polypeptide of claim 2 with a test compound; and 

4 b) determining whether said compound specifically binds to said polypeptide; 

5 wherein a detection that said compound specifically binds to said polypeptide indicates that said 

6 compound is a candidate modulator of said GENSET polypeptide. 

1 12. The method of claim 11, further comprising testing the biological activity of said GENSET 

2 polypeptide in the presence of said candidate modulator, wherein an alteration in the biological 

3 activity of said GENSET polypeptide in the presence of said compound in comparison to the 

4 activity in the absence of said compound indicates that the compound is a modulator of said 

5 GENSET polypeptide. 
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113. A method for the production of a pharmaceutical composition comprising 

2 a) identifying a modulator of a GENSET polypeptide using the method of claim 1 1 ; 

3 and 

4 b) combining said modulator with a physiologically acceptable carrier. 

5 
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SEQUENCE LISTING 

<110> GENSET 

<120> HUMAN CDNAS AND PROTEINS AND USES THEREOF 

<130> 91.W01 

<150> US 60/305,456 
<151> 2001-07-13 

<150> US 60/302,277 
<151> 2001-06-29 

<150> US 60/298,698 
<151> 2001-06-15 

<150> US 60/293,574 
<151> 2001-05-25 

<150> US 60/224,009 
<151> 2000-08-07 

<160> 112 

<170> JPatent 

<210> 1 

<211> 2016 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5'UTR 
<222> 1. .1434 

<220> 

<221> CDS 

<222> 1435. .1836 

<220> 

<221> 3'UTR 
<222> 1837. .2016 

<220> 

<221> polyA_signal 
<222> 1965. .1970 

<220> 

<221> polyA_site 
<222> 2001. .2016 

<400> 1 

aaggtctctc tgcatgcata caccaaggaa aagccacatg aggacataac caggaagaga 60 

gccatcacca agaacccgaa catgcggaca ccctgatctc ggacttctag ccttcagaac 120 

cgttgccaca gttttgatga tcatctctct cccaaccaag atggtggaaa aagcaaaaac 180 

gtggtgaatc ttggagcaat ccgacaaggc atgaaacgct tccaatttct gttaaactgc 240 

tgtgagccag ggacaattcc tgatgcctcc atcctagcag ctgccttgga tctactatgc 300 

ggcattcttc tgattcattt ttctccattt gtgctgtttt tctctgtgat gtgaatccat 360 

ccctatccat tatgtcatgc ctccatcttt tgctgcttct tcagattgca ctgagccata 420 

agaggaagcc cctgtggtgg ccagagcagc cttgttcctg gaatgtgctc gttttgttca 480 

ccgctgcaac cgtggcaact ggccagagtg gatgaaaggg caccacgtga acatcaccaa 540 

gaaaggactt tcccggggac gctctcccat tgtgggcaac aagcgaaacc agaagctgca 600 

gtggaatgca gccaagctct tctaccaatg gggagacaag gaaaaaaggt gaagaataaa 660 
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attttagaca gagctgaaca taaacacaca 72 0 
aaagattact tggaataact gttacaattt 7 80 
aatgtgctta ccaactaagg caattggcgt 840 
tgagagccca gccaacctgc tgggtctcat 900 
aaaggaggat gaggaggaag actttttaga 960 
tcatcttgca tttaaaagct gattatggtg 1020 
aagcttgtct tttccattct tgatgagagg 1080 
gagaaaatgg ttttcctgaa aaaaacgata 1140 
ccacctattt tcaaatgaaa tcgtgaaaaa 1200 
ctgccctcaa aacagcaaga cagacatccc 1260 
ctccaaatct agttcactgc catatacata 1320 
atcttgcgaa attacttccc atttctgttt 1380 
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85 90 95 

Asp Gin Cys Arg Asp Gly Lys Val Gly Phe Gin Ser Phe Phe Ser Leu 

100 105 110 

He Ala Gly Leu Thr lie Ala Cys Asn Asp Tyr Phe Val Val His Met 

115 120 125 
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130 

<210> 3 

<211> 1081 

<212> DNA 
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<222> 1. .38 
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<221> CDS 
<222> 39. .917 

<220> 

<221> 3»UTR 
<222> 918.. 1081 

<220> 

<221> polyA_signal 
<222> 1045. .1050 

<220> 

<221> polyA_site 
<222> 1066. .1081 

<400> 3 
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Tyr 


He 


Phe 


Ser Glu Ser Tyr Gly Gly Lys 


Met Ala Ala 


Gly He 
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135 










140 










145 








150 


oJLy 


Leu 


Glu 


Leu 


Tyr 

1 cc 

JLSj 


Lys 


Ala 


lie 


Gin 


Arg 
lou 


Gly Tnr 


ne 


Lys 


Cys 
165 


Asn 


rue 


Ala 


Gly 


XT-* 1 

val 

170 


Ala 


lieu 


Giy 


Asp 


Ser 
175 


Trp 


He Ser 


Pro 


val 
180 


Asp 


Ser 


vax 


Leu 


ser 
185 


Trp 


Giy 


Pro 


Tyr 


Leu 
190 


Tyr 


ser 


Met Ser 


T All 

Leu 
195 


Leu 


Glu 


Asp 


Lys 


Gly 


Leu 


Ala 


GlU 


val 


Ser 

O AC 


Lys 


Val 


Ala 


Glu Gin 

ALU 


Val 


Leu 


TV 

Asn 


Ala 


Val 


Asn 


Lys 


Gly 


Leu 


Tyr 


Arg 


Glu 


Ala 


Thr 


Glu Leu 


Trp 


Gly 


Lys 


Ala 


215 










220 










225 








230 


Glu 


Met 


lie 


He 


Glu 

235 


Gin 


Val 


Lys 


Arg 


Gly 
240 


Asn Thr 


Gin 


Arg 


Leu 
245 


Ala 


Cys 


Leu 


Ala 


Phe 
250 


Ser 


Gly 


Gly 


Tyr 


Arg 
255 


Ala 


His Gly 


Trp 


Cys 
260 


Cys 


Gin 


Thr 


Trp 


Ser 


Leu 


His 























265 



<210> 5 
<211> 438 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> 5'UTR 
<222> 1. .83 

<220> 
<221> CDS 
<222> 84. .317 

<220> 

<221> 3 'UTR 
<222> 318. .438 

<220> 

<221> polyA_signal 
<222> 397. .402 

<220> 

<221> polyA_site 
<222> 423. .438 

<400> 5 

atagaaaagg acatctcttg agacttcact tcagcttcac tgacttcttg actctcctct 60 
tgagtaaaag gactcagcca act atg aag ttt ttt gtc ttt get tta gtc ttg 113 

Met Lys Phe Phe Val Phe Ala Leu Val Leu 
-15 -10 



get 


etc 


atg 


att 


tec 


atg 


att 


age 


get 


gat 


tea 


cat 


gaa 


aag 


aga 


cat 


161 


Ala 


Leu 


Met 


He 
-5 


Ser 


Met 


He 


Ser 


Ala 
1 


Asp 


Ser 


His 


Glu 


Lys 


Arg 


His 




cat 


ggg 


tat 


aga 


aga 


aaa 


ttc 


cat 


gaa 


aag 


cat 


cat 


5 

tea 


tac 


cat 


ate 


209 


His 


Gly 


Tyr 


Arg 


Arg 


Lys 


Phe 


His 


Glu 


Lys 


His 


His 


Ser 


Tyr 


His 


lie 






10 










15 










20 












aca 


eta 


eta 


cca 


ctt 


ttt 


gaa 


gaa 


tea 


tea 


aag 


age 


aat 


gca 


aat 


gaa 


257 


Thr 


Leu 


Leu 


Pro 


Leu 


Phe 


Glu 


Glu 


Ser 


Ser 


Lys 


Ser 


Asn 


Ala 


Asn 


Glu 




25 










30 










35 










40 




aaa 


cac 


tat 


aat 


tta 


ctg 


tat 


act 


ctt 


tgt 


ttc 


agg 


ata 


ctt 


gee 


ttt 


305 


Lys 


His 


Tyr 


Asn 


Leu 


Leu 


Tyr 


Thr 


Leu 


Cys 


Phe 


Arg 


He 


Leu 


Ala 


Phe 





45- 50 55 



tea att gtc act tgatgatata attgeaattt aaactgttaa gctgtgttca 357 
Ser He Val Thr 
60 
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gtactgtttc tgaataatag aaatcacttc tctaaaagca ataaatttca agcacatttt 417 
taaataaaaa aaaaaaaaaa a 438 

<210> 6 
<211> 78 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> 1. .19 

<400> 6 

Met Lys Phe Phe Val Phe Ala Leu Val Leu Ala Leu Met He Ser Met 

-15 -10 -5 

He Ser Ala Asp Ser His Glu Lys Arg His His Gly Tyr Arg Arg Lys 

1 5 10 

Phe His Glu Lys His His Ser Tyr His He Thr Leu Leu Pro Leu Phe 

15 20 25 

Glu Glu Ser Ser Lys Ser Asn Ala Asn Glu Lys His Tyr Asn Leu Leu 
30 35 40 45 

Tyr Thr Leu Cys Phe Arg He Leu Ala Phe Ser He Val Thr 
50 55 

<210> 7 

<211> 968 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5'UTR 
<222> 1. .31 

<220> 
<221> CDS 
<222> 32. .748 

<220> 

<221> 3'UTR 
<222> 749. .968 

<220> 

<22l> polyA__signal 
<222> 928. .933 

<220> 

<221> polyA_site 
<222> 953. .968 

<400> 7 

tgatcaggac tcctcagttc accttctcac a atg agg etc cct get cag etc 52 

Met Arg Leu Pro Ala Gin Leu 
-15 

ctg ggg ctg eta atg etc tgg gtc tct gga tec agt ggg gat att gtg 100 
Leu Gly Leu Leu Met Leu Trp Val Ser Gly Ser Ser Gly Asp He Val 

-10 -5 1 

atg act cag tct cca etc ttc ctg ccc gtc acc cct gga gag ccg gec 148 
Met Thr Gin Ser Pro Leu Phe Leu Pro Val Thr Pro Gly Glu Pro Ala 
5 10 15 20 

tec ate tec tgc agg tct agt cag age etc ctg cat gtt caa ggg tec 196 
Ser He Ser Cys Arg Ser Ser Gin Ser Leu Leu His Val Gin Gly Ser 

25 30 35 

aac tat ttg gat tgg tac cac cag aag cca ggg cag tct cca caa etc 244 
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Asn Tyr 


Leu 


Asp 


Trp 


Tyr 


His 


Gin 


Lys 


Pro 


Gly 


Gin 


Ser 


Pro 


Gin 


Leu 








40 










45 










50 








ctg ata 


tac 


ttg 


ggt 


tct 


aat 


egg 


gec 


tec 


ggg 


gtc 


cct 


gac 


agg 


ttc 


292 


Leu lie 


Tyr 


Leu 


Gly 


Ser 


Asn 


Arg 


Ala 


Ser 


Gly 


Val 


Pro 


Asp 


Arg 


Phe 






55 










60 










65 










agt ggc 


agt 


gga 


tea 


ggc 


aca 


gat 


ttc 


aca 


ctg 


aaa 


ate 


agt 


aga 


gtg 


340 


Ser Gly 


Ser 


Gly 


Ser 


Gly 


Thr 


Asp 


Phe 


Thr 


Leu 


Lys 


lie 


Ser 


Arg 


Val 




70 










75 










80 












gag get 


gag 


gat 


gtt 


ggg 


gtt 


tat 


tac 


tgc 


atg 


caa 


get 


eta 


caa 


act 


388 


Glu Ala 


Glu 


Asp 


Val 


Gly 


Val 


Tyr 


Tyr 


Cys 


Met 


Gin 


Ala 


Leu 


Gin 


Thr 




85 








90 










95 










100 




cca ttc 


act 


ttc 


ggc 


cct 


ggg 


acc 


aga 


gtg 


gat 


ate 


aag 


cga 


act 


gtg 


436 


Pro Phe 


Thr 


Phe 


Gly 


Pro 


Gly 


Thr 


Arg 


Val 


Asp 


lie 


Lys 


Arg 


Thr 


Val 










105 










110 










115 






get gca 


cca 


tct 


gtc 


ttc 


ate 


ttc 


ccg 


cca 


tct 


gat 


gag 


cag 


ttg 


aaa 


484 


Ala Ala 


Pro 


Ser 


val 


Phe 


lie 


Phe 


Pro 


Pro 


Ser 


Asp 


Glu 


Gin 


Leu 


Lys 








120 










125 










130 








tct gga 


act 


gec 


tct 


gtt 


gtg 


tgc 


ctg 


ctg 


aat 


aac 


ttc 


tat 


ccc 


aga 


532 


Ser Gly 


Thr 


Ala 


Ser 


Val 


Val 


Cys 


Leu 


Leu 


Asn 


Asn 


Phe 


Tyr 


Pro 


Arg 






135 










140 










145 










gag gec 


aaa 


gta 


bag 


tgg 


aag 


gtg 


gat 


aac 


gec 


etc 


caa 


teg 


ggt 


aac 


580 


Glu Ala 


Lys 


Val 


Gin 


Trp 


Lys 


Val 


Asp 


Asn 


Ala 


Leu 


Gin 


Ser 


Gly 


Asn 




150 










155 










160 












tec cag 


gag 


agt 


gtc 


aca 


gag 


cag 


gac 


age 


aag 


gac 


age 


acc 


tac 


age 


628 


Ser Gin 


Glu 


Ser 


Val 


Thr 


Glu 


Gin 


Asp 


Ser 


Lys 


Asp 


Ser 


Thr 


Tyr 


Ser 




165 








170 










175 










180 




etc age 


age 


ace 


ctg 


acg 


ctg 


age 


aaa 


gca 


gac 


tac 


gag 


aaa 


cac 


aaa 


676 


Leu Ser 


Ser 


Thr 


Leu 


Thr 


Leu 


Ser 


Lys 


Ala 


Asp 


Tyr 


Glu 


Lys 


His 


Lys 










185 










190 










195 






gtc tac 


gee 


tgc 


gaa 


gtc 


acc 


cat 


cag 


ggc 


ctg 


age 


teg 


ccc 


gtc 


aca 


724 


Val Tyr 


Ala 


Cys 


Glu 


Val 


Thr 


His 


Gin 


Gly 


Leu 


Ser 


Ser 


Pro 


Val 


Thr 








200 










205 










210 








aag age 


ttc 


aac 


agg 


gga 


gag 


tgt 


tagagggaga agtgccccca cctgctcctc 


778 


Lys Ser 


Phe 


Asn 


Arg 


Gly 


Glu 


Cys 





















215 220 



agttccagcc tgaccccctc ccatcctttg gcctctgacc ctttttccac aggggaccta 838 
cccctattgc ggtcctccag ctcatctttc acctcacccc cctcctcctc cttggcttta 898 
attatgetaa tgttggagga gaatgaataa ataaagtgaa tctttgcacc tgttaaaaaa 958 
aaaaaaaaaa 968 

• 

<210> 8 
<211> 239 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> 1. .20 

<400> 8 

Met Arg Leu Pro Ala Gin Leu Leu Gly Leu Leu Met Leu Trp Val Ser 
-20 -15 -10 -5 

Gly Ser Ser Gly Asp lie Val Met Thr Gin Ser Pro Leu Phe Leu Pro 

1 5 10 

Val Thr Pro Gly Glu Pro Ala Ser He Ser Cys Arg Ser Ser Gin Ser 

15 20 25 

Leu Leu His Val Gin Gly Ser Asn Tyr Leu Asp Trp Tyr His Gin Lys 

30 35 ~ 40 

Pro Gly Gin Ser Pro Gin Leu Leu He Tyr Leu Gly Ser Asn Arg Ala 
45 50 55 60 

Ser Gly Val Pro Asp Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe 

65 70 75 

Thr Leu Lys He Ser Arg Val Glu Ala Glu Asp Val Gly Val Tyr Tyr 
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80 85 90 



Cys 


Fieu 


uin a .La. jjtsu. 




inr 


Pro 


ipne 


mr 


vne 


oiy 


Pro 


vi±y 


Thr 


Arg 












1 ftA 

lUu 










lUb 






Val 

Veil 


Aon 


lie jjyB nig 


-LIU 


v ax 


Til a 

ilia 


Jt\±ci 


Pro 


Car 

□ CI 


Val 


rUc 


lie 


pne 


Pro 




inn 

1JLU 






lib 










i on 












OCi 


7V nn flTn CX~\ n 

iitip uXU will 


uou 


T ,vc 
uy a 






T>it- 
JLill. 




Cav 
OCI 


Val 


Val 


Cys 


Leu 


X*sD 






J. J u 










1 "3 C 










140 


Leu 


Asn 


Asn Phe Tyr 


Pro 


Ara 


Glu 


Ala 


Lys 


Val 


Gin 




Lys 


Val 


7\ cn 






145 










150 










155 




As n 


Ala 


Leu Gin Ser 


Gly 


Asn 


Ser 


Gin 


Glu 


Ser 


Val 


Thr 


Glu 


Gin 


Asp 






160 








165 










170 




Ser 


Lys 


Asp Ser Thr 


Tyr 


Ser 


Leu 


Ser 


Ser 


Thr 


Leu 


Thr 


Leu 


Ser 


Lys 






175 






180 










185 








Ala 


Asp 


Tyr Glu Lys 


His 


Lys 


Val 


Tyr 


Ala 


Cys 


Glu 


Val 


Thr 


His 


Gin 




190 






195 










200 










Gly 


Leu 


Ser Ser Pro 


Val 


Thr 


Lys 


Ser 


Phe 


Asn 


Arg 


Gly 


Glu 


Cys 





205 210 215 

<210> 9 
<211> 730 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> 5'UTR 
<222> 1. .253 

<220> 
<221> CDS 
<222> 254. .574 

<220> 

<221> 3'UTR 
<222> 575. .730 

<400> 9 

agatgagtgt tcagctctca gcagagaggt tagctcctct ctgcagcttg tcctgttgtc 60 

tcctcaagtc tggctgagtc cggagttttt atgagcct ca gaggggagga agtgcatgct 120 

gattaatcca tgggcaggcc tggaaaagtt cccactccag tctgcgggac ccacagcctg 180 

gccctcaggc ctcaggcctt cccaggcttg aagattgggc ttcacctggg acctacccct 240 

tctgcctagg age atg tct gec tec tgc tgc ctt tea tgg tgc cca gec 289 
Met Ser Ala Ser Cys Cys Leu Ser Trp Cys Pro Ala 

















-10 










-5 










aag 


get 


aag 


teg 


aaa 


tgt 


ggc 


cca 


acc 


ttc 


ttc 


ccc 


tgt 


gee 


age 


ggc 


337 


Lys 


Ala 
1 


Lys 


Ser 


Lys 


Cys 
5 


Gly 


Pro 


Thr 


Phe 


Phe 
10 


Pro 


Cys 


Ala 


Ser 


Gly 
15 




ate 


cat 


tgc 


ate 


att 


ggt 


cgc 


ttc 


egg 


tgc 


aat 


ggg 


ttt 


gag 


gac 


tgt 


385 


He 


His 


Cys 


He 


He 
20 


Gly 


Arg 


Phe 


Arg 


Cys 
25 


Asn 


Gly 


Phe 


Glu 


Asp 
30 


Cys 




ccc 


gat 


ggc 


age 


gat 


gaa 


gag 


aac 


tgc 


aca 


gca 


aac 


cct 


ctg 


ctt 


tgc 


433 


Pro 


Asp 


Gly 


Ser 
35 


Asp 


Glu 


Glu 


Asn 


Cys 
40 


Thr 


Ala 


Asn 


Pro 


Leu 
45 


Leu 


Cys 




tec 


acc 


gec 


cgc 


tac 


cac 


tgc 


aag 


aac 


ggc 


etc 


tgt 


att 


gac 


aag 


age 


481 


Ser 


Thr 


Ala 
50 


Arg 


Tyr 


His 


Cys 


Lys 
55 


Asn 


Gly 


Leu 


Cys 


He 
60 


Asp 


Lys 


Ser 




ttc 


ate 


tgc 


gat 


gga 


cag 


aat 


aac 


tgt 


caa 


gac 


aac 


agt 


gat 


gag 


gaa 


529 


Phe 


He 


Cys 


Asp 


Gly 


Gin 


Asn 


Asn 


Cys 


Gin 


Asp 


Asn 


Ser 


Asp 


Glu 


Glu 






65 










70 










75 










age 


tgt 


gaa 


agt 


tct 


caa 


get 


att 


ttt 


cca 


caa 


att 


act 


gtg 


tec 




574 


Ser 


Cys 


Glu 


Ser 


Ser 


Gin 


Ala 


He 


Phe 


Pro 


Gin 


lie 


Thr 


Val 


Ser 






80 










85 










90 















tgagccctga gctaattaag tgctggataa gcatcacctc ccagtaatcc tgttatcagc 634 
ctttgaaatg taggtagctt tattatccac attttgeaga tgaggaaaca gagtcaggtg 694 
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aagtgtcttt tccaaggcca agctcctgag ggcagg 730 

<210> 10 

<211> 107 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> 1. .14 

<400> 10 



Met Ser 


Ala Ser Cys 


Cys Leu Ser 


Trp 


Cys 


Pro Ala 


Lys 


Ala Lys 


Ser 




-10 






-5 






1 




Lys Cys 


Gly Pro Thr 


Phe Phe Pro 


Cys 


Ala 


Ser Gly 


lie 


His Cys 


lie 




5 


10 








15 






lie Gly 


Arg Phe Arg 


Cys Asn Gly 


Phe 


Glu 


Asp Cys 


Pro 


Asp Gly 


Ser 


20 




25 






30 








Asp Glu 


Glu Asn Cys 


Thr Ala Asn- 


Pro 


Leu 


Leu Cys 


Ser 


Thr Ala 


Arg 


35 




40 






45 






50 


Tyr His 


Cys Lys Asn 


Gly Leu Cys 


lie 


Asp 


Lys Ser 


Phe 


lie Cys 


Asp 




55 






60 






65 




Gly Gin 


Asn Asn Cys 


Gin Asp Asn 


Ser 


Aep 


Glu Glu 


Ser 


Cys Glu 


Ser 




70 




75 








80 




Ser Gin 


Ala lie Phe 


Pro Gin lie 


Thr 


Val 


Ser 









85 90 



<210> 11 

<211> 733 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5»UTR 
<222> 1. .253 

<220> 
<221> CDS 
<222> 254. .574 

<220> 

<221> 3»UTR 
<222> 575. .733 

<400> 11 

agatgagtgt tcagctctca gcagagaggt tagctcctct ctgcagcttg tcctgttgtc 60 

tcctcaagtc tggctgagtc cggagttttt atgagcctca gaggggagga agtgcatgct 120 

gattaatcca tgggcaggcc tggaaaagtt cccactccag tctgcgggac ccacagcctg 180 

gccctcaggc ctcaggcctt ccctggcttg aagattgggc ttcacctggg acctacccct 240 

tctgcctagg age atg tct gec tec tgc tgc ctt tea tgg tgc cca gee 289 
Met Ser Ala Ser Cys Cys Leu Ser Trp Cys Pro Ala 













-10 










-5 






aag get 


aag 


teg aaa 


tgt 


ggc 


cca 


acc 


ttc 


ttc 


ccc 


tgt 


gee age ggc 


337 


Lys Ala 


Lys 


Ser Lys 


Cys 


Gly 


Pro 


Thr 


Phe 


Phe 


Pro 


Cys 


Ala Ser Gly 




1 






5 










10 






15 




ate cat 


tgc 


ate att 


ggt 


cgc 


ttc 


egg 


tgc 


aat 


ggg 


ttt 


gag gac tgt 


385 


lie His 


Cys 


He He 


Gly 


Arg 


Phe 


Arg 


Cys 


Asn 


Gly 


Phe 


Glu Asp Cys 








20 










25 








30 




ccc gat 


ggc 


age gat 


gaa 


gag 


aac 


tgc 


aca 


gca 


aac 


cct 


ctg ctt tgc 


433 


Pro Asp 


Gly 


Ser Asp 


Glu 


Glu 


Asn 


Cys 


Thr 


Ala 


Asn 


Pro 


Leu Leu Cys 








35 








40 










45 




tec ace 


gee 


cgc tac 


cac 


tgc 


aag 


aac 


ggc 


etc 


tgt 


att 


gac aag age 


481 


Ser Thr 


Ala 


Arg Tyr 


His 


Cys 


Lys 


Asn 


Gly 


Leu 


Cys 


He 


Asp Lys Ser 
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50 55 60 

ttc ate tgegat gga cag aat aac tgt caa gac aac agt gat gag gaa 529 
Phe lie Cys Asp Gly Gin Asn Asn Cys Gin Asp Asn Ser Asp Glu Glu 

65 70 75 

age tgt gaa agt tct caa get att ttt cca caa att act gtg tec 574 
Ser Cys Glu Ser Ser Gin Ala lie Phe Pro Gin lie Thr Val Ser 
80 85 90 

tgagccctga gctaattaag tgctggataa gcatcacctc ccagtaatcc tgttatcagc 634 
ctttgaaatg taggtagctt tattatccac attttgeaga tgaggaaaca gagtcaggtg 694 
aagtgtcttt tccaaggcca agctcctgag ggcaggggc 733 

<210> 12 

<211> 107 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> 1. .14 

<400> 12 



Met Ser 


Ala Ser Cys 


Cys Leu Ser 


Trp 


Cys 


Pro Ala Lys Ala Lys Ser 




-10 






-5 


1 


Lys Cys 


Gly Pro Thr 


Phe Phe Pro 


Cys 


Ala 


Ser Gly He His Cys He 




5 


10 






15 


Xle Gly 


Arg Phe Arg 


Cys Asn Gly 


Phe 


Glu 


Asp Cys Pro Asp Gly Ser 


20 




25 






30 


Asp Glu 


Glu Asn Cys 


Thr Ala Asn 


Pro 


Leu 


Leu Cys Ser Thr Ala Arg 


35 




40 






45 50 


Tyr His 


Cys Lys Asn 


Gly Leu Cys 


He 


Asp 


Lys Ser Phe He Cys Asp 




55 






60 


65 


Gly Gin 


Asn Asn Cys 


Gin Asp Asn 


Ser 


Asp 


Glu Glu Ser Cys Glu Ser 




70 




75 




80 


Ser Gin 


Ala lie Phe 


Pro Gin lie 


Thr 


Val 


Ser 




85 


90 









<210> 13 
<211> 732 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> S'lTTR 
<222> 1. .253 

<220> 
<221> CDS 
<222> 254. .574 

<220> 

<221> 3'UTR 
<222> 575. .732 

<400> 13 

agatgagtgt tcagctctca gcagagaggt tagctcctct ctgcagcttg tcctgttgtc 60 
tcctcaagtc tggctgagtc cggagttttt atgagectea gaggggagga agtgcatgct 120 
gat t aat cca tgggcaggee tggaaaagtt cccactccag tetgegggae ccacagcctg 180 
gccctcaggc ctcaggcctt ccctggcttg aagattgggc ttcacctggg acctacccct 240 
tetgectagg age atg tct gee tec tgc tgc ctt tea tgg tgc cca gee 289 
Met Ser Ala Ser Cys Cys Leu Ser Trp Cys Pro Ala 
-10 -5 
aag get aag teg aaa tgt ggc cca ace ttc ttc ccc tgt gee age ggc 337 
Lys Ala Lys Ser Lys Cys Gly Pro Thr Phe Phe Pro Cys Ala Ser Gly 
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1 




5 


10 




15 




ate 


cat 


tgc ate att 


ggt cgc ttc egg 


tgc aat ggg 


ttt gag gac 


tgt 


385 


lie 


His 


Cys He He 


Gly Arg Phe Arg 


Cys Asn Gly 


Phe Glu Asp 


Cys 








20 




25 


30 






ccc 


gat 


ggc age gat 


gaa gag aac tgc 


aca gca aac 


cct ctg ctt 
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Pro 


Asp 


Gly Ser Asp 


Glu Glu Asn Cys 


Thr Ala Asn 


Pro Leu Leu 


Cys 








35 
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45 






tec 


acc 


gec cgc tac 
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att gac aag 


age 


481 


Ser 


Thr 


Ala Arg Tyr 


His Cys Lys Asn 


Gly Leu Cys 


He Asp Lys 


Ser 








50 


55 




60 






ttc 


ate 


tgc gat gga 


cag aat aac tgt 


caa gac aac 


agt gat gag 


gaa 


529 


Phe 


He 


Cys Asp Gly 


Gin Asn Asn Cys 


Gin Asp Asn 


Ser Asp Glu 


Glu 






65 




70 


75 








age 


tgt 


gaa agt tct 


caa get att ttt 


cca caa att 


act gtg tec 




574 


Ser 


Cys 


Glu Ser Ser 


Gin Ala He Phe 


Pro Gin He 


Thr Val Ser 






80 






85 


90 









tgagccctga gctaattaag tgctggataa gcatcacctc ccagtaatcc tgttatcagc 634 
ctttgaaatg taggtagctt attatccaca ttttgeagat gaggaaacag agtcaggtga 694 
agtgtctttt ccaaggccaa gctcctgagg gcaggggc 732 

<210> 14 
<211> 107 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> 1..14 



<400> 14 

Met Ser Ala Ser Cys Cys Leu Ser Trp Cys Pro Ala Lys Ala Lys Ser 

-10 -5 ^ 1 

Lys Cys Gly Pro Thr Phe Phe Pro Cys Ala Ser Gly He His Cys He 

5 10 15 

He Gly Arg Phe Arg Cys Asn Gly Phe Glu Asp Cys Pro Asp Gly Ser 

20 25 30 

Asp Glu Glu Asn Cys Thr Ala Asn Pro Leu Leu Cys Ser Thr Ala Arg 
35 40 45 50 

Tyr His Cys Lys Asn Gly Leu Cys He Asp Lys Ser Phe He Cys Asp 

55 60 65 

Gly Gin Asn Asn Cys Gin Asp Asn Ser Asp Glu Glu Ser Cys Glu Ser 

70 75 80 

Ser Gin Ala He Phe Pro Gin He Thr Val Ser 
85 90 

<210> 15 

<211> 733 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5'UTR 
<222> 1..253 



<220> 
<221> CDS 
<222> 254. .574 



<220> 

<221> 3'UTR 
<222> 575. .733 



<400> 15 
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agatgagtgt tcagctctca gcagagaggt tagctcctct ctgcagcttg tcctgttgtc 60 

tcctcaagtc tggctgagtc cggagttttt atgagcctca gaggggagga agtgcatgct 120 

gattaatcca tgggcaggcc tggaaaagtt cccactccag tctgcgggac ccacagcctg 180 

gccctcaggc ytcaggcctt cccaggcttg aagattgggc ttcacctggg acctacccct 240 

tctgcctagg age atg tct gec tec tgc tgc ctt tea tgg tgc cca gee 289 
Met Ser Ala Ser Cys Cys Leu Ser Trp Cys Pro Ala 
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aat ggg 
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385 


He His 
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He 


He 
20 
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Cys 
25 
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30 


Cys 
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ggc 


age 


gat 


gaa 


gag 


aac 


tgc 


aca 


gca aac 


cct 


ctg 


ctt 


tgc 


433 


Pro Asp 


Gly 


Ser 
35 


Asp 


Glu 


Glu 


Asn 


Cys 
40 


Thr 


Ala Asn 


Pro 


Leu 
45 


Leu 


Cys 




tec ace 


gee 
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tac 


cac 


tgc 


aag 


aac 


ggc 


etc tgt 


att 


gac 


aag 


age 


481 


Ser Thr 


Ala 
50 


Arg 


Tyr 


His 


Cys 


Lys 
55 


Asn 


Gly 


Leu Cys 


He 
60 


Asp 


Lys 


Ser 




ttc ate 


tgc 


gat 


gga 


cag 


aat 


aac 


tgt 


caa 


gac aac 


agt 


gat 


gag 


gaa 


529 


Phe He 


Cys 


Asp 


Gly 


Gin 


Asn 


Asn 


Cys 


Gin 


Asp Asn 


Ser 


Asp 


Glu 


Glu 




65 










70 








75 












age tgt 


gaa 


agt 


tct 


caa 


get 


att 


ttt 


cca 


caa att 


act 


gtg 


tec 




574 


Ser Cys 


Glu 


Ser 


Ser 


Gin 


Ala 


He 


Phe 


Pro 


Gin He 


Thr 


Val 


Ser 






80 








85 










90 













tgagccctga gctaattaag tgctggataa gcatcacctc ccagtaatcc tgttatcagc 634 
ctttgaaatg taggtagctt tattatccac attttgeaga tgaggaaaca gagtcaggtg 694 
aagtgtcttt tccaaggcca agctcctgag ggcaggggc 733 



<210> 16 

<211> 107 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> 1. .14 

<400> 16 

Met Ser Ala Ser Cys Cys Leu Ser Trp Cys Pro Ala Lys Ala Lys Ser 

-10 -5 1 

Lys Cys Gly Pro Thr Phe Phe Pro Cys Ala Ser Gly He His Cys He 

5 10 15 

He Gly Arg Phe Arg Cys Asn Gly Phe Glu Asp Cys Pro Asp Gly Ser 

20 25 30 

Asp Glu Glu Asn Cys Thr Ala Asn Pro Leu Leu Cys Ser Thr Ala Arg 
35 40 45 50 

Tyr His Cys Lys Asn Gly Leu Cys He Asp Lys Ser Phe He Cys Asp 

55 60 65 

Gly Gin Asn Asn Cys Gin Asp Asn Ser Asp Glu Glu Ser Cys Glu Ser 

70 75 80 

Ser Gin Ala He Phe Pro Gin He Thr Val Ser 
85 90 

<210> 17 

<211> 1175 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5»UTR 
<222> 1. .326 
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<220> 
<221> CDS 
<222> 327. .1013 

<220> 

<221> 3'UTR 
<222> 1014. .1175 

<220> 

<221> polyA_signal 
<222> 1131. .1136 

<220> 

<221> polyA_site 
<222> 1160. .1175 

<400> 17 

gaagcggagc ggtctaggga gccgcggccg cgggtcaccc ggcgggtagc agttgctgag 60 
tgtcagctag acagcagcga ctagggctcg ggcgccggcg agatgccttt gttcaccgcc 120 
aaccccttcg agcaagacgt ggtgatgcca attggtggaa aggagaaaat cacagaggaa 180 
taggactttt cccatccaat tttgtaacaa ctaatttaaa catagagact gaggcagcgg 240 
ctgtggacaa attgaatgta attgatgatg atgtggagga aattaagaaa tcagagcctg 300 
agcctgttta tatagatgag gataag atg gat aga gcc ctg cag gta ctt cag 353 

Met Asp Arg Ala Leu Gin Val Leu Gin 

1 5 

agt ata gat cca aca gat tea aaa cca gac tec caa gac ctt ttg gat 401 

Ser He Asp Pro Thr Asp Ser Lys Pro Asp Ser Gin Asp Leu Leu Asp 

10 15 20 25 

tta gaa gat ate tgc caa cag atg ggt cca atg ata gat gaa aaa ctt 449 

Leu Glu Asp He Cys Gin Gin Met Gly Pro Met He Asp Glu Lys Leu 

30 35 40 

gaa gaa att gat agg aag cat tea gaa ttg tct gaa ttg aat gtt aaa 4 97 
Glu Glu He Asp Arg Lys His Ser Glu Leu Ser Glu Leu Asn Val Lys 

45 50 55 

gtc ctg gaa get ctg gaa eta tat aac aaa ttg gtg aat gaa gca cca 545 
Val Leu Glu Ala Leu Glu Leu Tyr Asn Lys Leu Val Asn Glu Ala Pro 

60 65 70 

gtg tac tea gtc tat tea aag etc cac cct cca gca cat tac cca cct 593 
Val Tyr Ser Val Tyr Ser Lys Leu His Pro Pro Ala His Tyr Pro Pro 

75 80 85 

gca tea tct ggg gtt cca atg cag aca tat cca gtt caa tea cat ggt 641 
Ala Ser Ser Gly Val Pro Met Gin Thr Tyr Pro Val Gin Ser His Gly 
90 95 100 105 

gga aac tat atg ggt cag age att cac caa gta act gtt gcc caa age 689 
Gly Asn Tyr Met Gly Gin Ser He His Gin Val Thr Val Ala Gin Ser 

110 115 120 

tat age eta gga ccc gat caa att ggt cca ctg aga tct ctg cct cca 737 
Tyr Ser Leu Gly Pro Asp Gin He Gly Pro Leu Arg Ser Leu Pro Pro 

125 130 135 

aat gtg aat tec tea gtg aca gca cag cct get caa act tea tat tta 785 
Asn Val Asn Ser Ser Val Thr Ala Gin Pro Ala Gin Thr Ser Tyr Leu 

140 145 150 

age act gga caa gac act gtt tec aat cct act tat atg aac cag aac 833 
Ser Thr Gly Gin Asp Thr Val Ser Asn Pro Thr Tyr Met Asn Gin Asn 

155 160 165 

tct aac eta cag tea get act ggt aca act get tac aca cag caa atg 881 
Ser Asn Leu Gin Ser Ala Thr Gly Thr Thr Ala Tyr Thr Gin Gin Met 
170 175 180 185 

999 atg tct gtg gat atg tea tct tat cag aac act act tec aat ttg 929 
Gly Met Ser Val Asp Met Ser Ser Tyr Gin Asn Thr Thr Ser Asn Leu 

190 195 200 

cct caa ctg gca ggc ttt ccg gtg aca gtt cca get cat cca gtt gca 977 
Pro Gin Leu Ala Gly Phe Pro Val Thr Val Pro Ala His Pro Val Ala 
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205 210 215 

cag cag cac aca aat tac cat cag cag cct etc ctt tagaaacaaa 1023 
Gin Gin His Thr Asn Tyr His Gin Gin Pro Leu Leu 

220 225 
tcaagcattt tcttgaaagc cttcataagt gtattattca gtccttgtga taccaacctg 1083 
aaaatattaa aacttttttc cctctcaact caaaaggacc atgaataaat aaagcacaaa 1143 
aacctctctt attctgaaaa aaaaaaaaaa at 1175 

<210> 18 

<211> 229 

<212> PRT 

<213> Homo sapiens 

<400> 18 

Met Asp Arg Ala Leu Gin Val Leu Gin Ser lie Asp Pro Thr Asp Ser 

1 5 10 15 

LyB Pro Asp Ser Gin Asp Leu Leu Asp Leu Glu Asp He Cys Gin Gin 

20 25 30 

Met Gly Pro Met He Asp Glu Lys Leu Glu Glu He Asp Arg Lys His 

35 40 45 

Ser Glu Leu Ser Glu Leu Asn Val Lys Val Leu Glu Ala Leu Glu Leu 

50 55 60 

Tyr Asn Lys Leu Val Asn Glu Ala Pro Val Tyr Ser Val Tyr Ser Lys 
65 70 75 80 

Leu His Pro Pro Ala His Tyr Pro Pro Ala Ser Ser Gly Val Pro Met 

85 90 95 

Gin Thr Tyr Pro Val Gin Ser His Gly Gly Asn Tyr Met Gly Gin Ser 

100 105 110 

He His Gin Val Thr Val Ala Gin Ser Tyr Ser Leu Gly Pro Asp Gin 

115 120 125 

He Gly Pro Leu Arg Ser Leu Pro Pro Asn Val Asn Ser Ser Val Thr 

13 0 135 140 

Ala Gin Pro Ala Gin Thr Ser Tyr Leu Ser Thr Gly Gin Asp Thr Val 
145 150 155 " 160 

Ser Asn Pro Thr Tyr Met Asn Gin Asn Ser Asn Leu Gin Ser Ala Thr 

165 170 175 

Gly Thr Thr Ala Tyr Thr Gin Gin Met Gly Met Ser Val Asp Met Ser 

180 185 190 

Ser Tyr Gin Asn Thr Thr Ser Asn Leu Pro Gin Leu Ala Gly Phe Pro 

195 200 205 

Val Thr Val Pro Ala His Pro Val Ala Gin Gin His Thr Asn Tyr His 

210 215 220 

Gin Gin Pro Leu Leu 
225 

<210> 19 

<211> 844 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5'UTR 
<222> 1..111 

<220> 
<221> CDS 
<222> 112. .813 

<220> 

<221> 3'UTR 
<222> 814. .844 

<400> 19 
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tttcctgttg cctgtctcta aacccctcca cattcccgcg gtccttcaga ctgcccggag 60 
agcgcgctct gcctgccgcc tgcctgcctg ccactgaggg ttcccagcac c atg agg 117 

Met Arg 
-15 

gcc tgg ate ttc ttt etc ctt tgc ctg gec ggg agg gec ttg gca gec 165 
Ala Trp He Phe Phe Leu Leu Cys Leu Ala Gly Arg Ala Leu Ala Ala 

-10 -5 1 

cct cag caa gaa gcc ctg cct gat gag aca gag gtg gtg gaa gaa act 213 
Pro Gin Gin Glu Ala Leu Pro Asp Glu Thr Glu Val Val Glu Glu Thr 

5 10 15 

gtg gca gag gtg act gag gta tct gtt gga get aat cct gtc cag gtg 261 
Val Ala Glu Val Thr Glu Val Ser Val Gly Ala Asn Pro Val Gin Val 

20 25 30 

gaa gta gga gaa ttt gat gat ggt gca gag gaa ace gaa gag gag gtg 309 
Glu Val Gly Glu Phe Asp Asp Gly Ala Glu Glu Thr Glu Glu Glu Val 
35 40 45 50 

gtg gcg gaa aat ccc tgc cag aac cac cac tgc aaa cac ggc aag gtg 357 
Val Ala Glu Asn Pro Cys Gin Asn His His Cys Lys His Gly Lys Val 

55 60 65 

tgc gag ctg gat gag aac aac acc ccc atg tgc gtg tgc cag gac ccc 405 
Cys Glu Leu Asp Glu Asn Asn Thr Pro Met Cys Val Cys Gin Asp Pro 

70 75 80 

acc age tgc cca gcc ccc att ggc gag ttt gag aag gtg tgc age aat 453 
Thr Ser Cys Pro Ala Pro He Gly Glu Phe Glu Lys Val Cys Ser Asn 

85 90 95 

gac aac aag acc ttc gac tct tec tgc cac ttc ttt gcc aca aag tgc 501 
Asp Asn Lys Thr Phe Asp Ser Ser Cys His Phe Phe Ala Thr Lys Cys 

100 105 110 

acc ctg gag ggc acc aag aag ggc cac aag etc cac ctg gac tac ate 549 
Thr Leu Glu Gly Thr Lys Lys Gly His Lys Leu His Leu Asp Tyr He 
115 120 125 130 

ggg cct tgc aaa tac ate ccc cct tgc ctg gac tct gag ctg acc gaa 597 
Gly Pro Cys Lys Tyr He Pro Pro Cys Leu Asp Ser Glu Leu Thr Glu 

135 14 0 145 

ttc ccc ctg cgc atg egg gac tgg etc aag aac gtc ctg gtc acc ctg 645 
Phe Pro Leu Arg Met Arg Asp Trp Leu Lys Asn Val Leu Val Thr Leu 

150 155 160 

tat gag agg gat gag gac aac aac ctt ctg act gag aag cag aag ctg 693 
Tyr Glu Arg Asp Glu Asp Asn Asn Leu Leu Thr Glu Lys Gin Lys Leu 

165 170 175 

egg gtg aag aag ate cat gag aat gag aag cgc ctg gag gca gga gac 741 
Arg Val Lys Lys He His Glu Asn Glu Lys Arg Leu Glu Ala Gly Asp 

180 185 190 

cac ccc gtg gag ctg ctg gcc egg gac tgc cag get gtt tea gcc agg 789 
His Pro Val Glu Leu Leu Ala Arg Asp Cys Gin Ala Val Ser Ala Arg 
195 200 205 210 

aag gcc aaa ate aag agt gag atg tagaaagttg taaaatagaa aaagtggagt 843 
Lys Ala Lys He Lys Ser Glu Met 

215 

t 844 

<210> 20 
<211> 234 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> 1. .17 

<400> 20 

Met Arg Ala Trp He Phe Phe Leu Leu Cys Leu Ala Gly Arg Ala Leu 
-15 -io -5 
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Ser 
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His 
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100 
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His 


Lys 
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His 


Leu 
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120 
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Pro 


Pro 


Cys 
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Glu 


Leu 




130 






135 










140 








Thr Glu 
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Leu Arg Met 


Arcj 


Asp 
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Leu Lys 
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Val 


Leu 


Val 


145 






150 










155 










Thr Leu 
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Glu 


Arg Asp Glu 


Asp 


Asn 


Asn 


Leu 


Leu 


Thr 


Glu 


Lys 


Gin 


160 






165 








170 










175 


Lys Leu 


Arg 


Val 


Lys Lys lie 


His 


Glu 


Asn 


Glu Lys 


Arg 


Leu 


Glu 


Ala 








180 






185 










190 




Gly Asp 


His 


Pro 


Val Glu Leu 


Leu 


Ala 


Arg 


Asp 


Cys 


Gin 


Ala 


Val 
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195 






200 










205 






Ala Arg 


Lys 


Ala 


Lys He Lys 


Ser 


Glu 


Met 















210 215 



<210> 21 

<211> 1997 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5 'UTR 
<222> 1. .126 

<220> 
<221> CDS 
<222> 127. .1020 

<220> 

<221> 3 'UTR 
<222> 1021. .1997 

<400> 21 

atcctctaag cttttaaata ttgcttcgat ggtctgaatt tttatttcca gggaaaaaga 60 
gagttttgtc ccacagtcag caggccacta gtttattaac ttccagtcac cttgattttt 120 
gctaaa atg aag act ctg cag tct aca ctt etc ctg tta ctg ctt gtg 168 
Met Lys Thr Leu Gin Ser Thr Leu Leu Leu Leu Leu Leu Val 









-15 








-10 










-5 




cct 


ctg ata 


aag 


cca gca 


cca 


cca acc 


cag 


cag 


gac 


tea 


cgc 


att 


ate 


216 


Pro 


Leu He 


Lys 


Pro Ala 


Pro 


Pro Thr 


Gin 


Gin 


Asp 


Ser 


Arg 


He 


He 










1 
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10 








tat 


gat tat 
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aca gat 
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gaa 


tec 


ata 


ttt 


age 


caa 


gat 


264 


Tyr 


Asp Tyr 


Gly 


Thr Asp 


Asn 


Phe Glu 


Glu 


Ser 


He 


Phe 


Ser 


Gin 


Asp 






15 








20 








25 










tat 


gag gat 


aaa 


tac ctg 


gat 


gga aaa 


aat 


att 


aag 


gaa 


aaa 


gaa 


act 


312 


Tyr 


Glu Asp 


Lys 


Tyr Leu 


Asp 


Gly Lys 


Asn 


He 


Lys 


Glu 


Lys 


Glu 


Thr 






30 






35 








40 












gtg 


ata ata 


ccc 


aat gag 


aaa 


agt ctt 


caa 


tta 


caa 


aaa 


gat 


gag 


gca 


360 


Val 


He He 


Pro 


Asn Glu 


Lys 


Ser Leu 


Gin 


Leu 


Gin 


Lys 


Asp 


Glu 


Ala 
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45 










50 










55 








60 




ata 


aca 


cca 


tta 


cct 


ccc 


aag 


aaa 


gaa 


aat 


gat gaa 


atg 


ccc 


acg 


tgt 


408 


lie 


Thr 


Pro 


Leu 


Pro 
65 


Pro 


Lys 


Lys 


Glu 


Asn 
70 


Asp Glu 


Met 


Pro 


Thr 
75 


Cys 




ctg 


ctg 


tgt 


gtt 


tgt 


tta 


agt 


ggc 


tct 


gta 


tac tgt 


gaa 


gaa 


gtt gac 


456 


Leu 


Leu 


Cys 


Val 
80 


Cys 


Leu 


Ser 


Gly 


Ser 
85 


Val 


Tyr Cys 


Glu 


Glu 
90 


Val 


Asp 




att 


gat 


get 


gta 


cca 


ccc 


tta 


cca 


aag 


gaa 


tea gee 


tat 


ctt 


tac 


gca 


504 


He Asp 


Ala 


Val 


Pro 


Pro 


Leu 


Pro 


Lys 


Glu 


Ser Ala 


Tyr 


Leu 


Tyr Ala 








95 










100 








105 










cga 


ttc 


aac 


aaa 


att 


aaa 


aag 


ctg 


act 


gee 


aaa gat 


ttt 


gca 


gac 


ata 


552 


Arg Phe 


Asn 


Lys 


He 


Lys 


Lys 


Leu 


Thr 


Ala 


Lys Asp 


Phe 


Ala 


Asp 


He 






110 










115 








120 












cct 


aac 


tta 


aga 


aga 


etc 


gat 


ttt 


aca 


gga 


aat ttg 


ata 


gaa 


gat 


ata 


600 


Pro 


Asn 


Leu 


Arg 


Arg 


Leu 


Asp 


Phe 


Thr 


Gly 


Asn Leu 


He 


Glu 


Asp 


He 




125 










130 










135 








140 . 




gaa 


gat 


ggt 


act 


ttt 


tea 


aaa 


ctt 


tct 


ctg 


tta gaa 


gaa 


ctt 


tea 


ctt 


648 


Glu Asp 


Gly 


Thr 


Phe 


Ser 


Lys 


Leu 


Ser 


Leu 


Leu Glu 


Glu 


Leu 


Ser 


Leu 












145 










150 








155 






get 


gaa 


aat 


caa 


eta 


eta 


aaa 


ctt 


cca 


gtt 


ctt cct 


ccc 


aag 


etc 


act 


696 


Ala 


Glu 


Asn 


Gin 
160 


Leu 


Leu 


Lys 


Leu 


Pro 
165 


Val 


Leu Pro 


Pro 


Lys 
170 


Leu 


Thr 




tta 


ttt 


aat 


gca 


aaa 


tac 


aac 


aaa 


ate 


aag 


agt agg 


gga 


ate 


aaa 


gca 


744 


Leu 


Phe 


Asn 


Ala 


Lys 


Tyr 


Asn 


Lys 


He 


Lys 


Ser Arg 


Gly 


He 


Lys Ala 








175 










180 








185 










aat 


gca 


ttc 


aaa 


aaa 


ctg 


aat 


aac 


etc 


acc 


ttc etc 


tac 


ttg 


gac 


cat 


792 


Asn 


Ala 


Phe 


Lys 


Lys 


Leu 


Asn 


Asn 


Leu 


Thr 


Phe Leu 


Tyr 


Leu 


Asp His 






190 










195 








200 












aat 


gec 


ctg 


gaa 


tec 


gtg 


cct 


ctt 


aat 


tta 


cca gaa 


agt 


eta 


cgt gta 


840 


Asn 


Ala 


Leu 


Glu 


Ser 


Val 


Pro 


Leu 


Asn 


Leu 


Pro Glu 


Ser 


Leu 


Arg Val 




205 










210 










215 








220 




att 


cat 


ctt 


cag 


ttc 


aac 


aac 


ata 


get 


tea 


att aca 


gat 


gac 


aca 


ttc 


888 


He 


His 


Leu 


Gin 


Phe 
225 


Asn 


Asn 


He 


Ala 


Ser 
230 


He Thr 


Asp 


Asp 


Thr 
235 


Phe 




tgc 


aag 


get 


aat 


gac 


acc 


agt 


tac 


ate 


egg 


gac cgc 


att 


gaa 


gag 


ata 


936 


Cys 


Lys 


Ala 


Asn 
240 


Asp 


Thr 


Ser 


Tyr 


He 
245 


Arg 


Asp Arg 


He 


Glu 
250 


Glu 


He 




cgc 


ctg 


gag 


ggc 


aat 


cca 


ate 


gtc 


ctg 


gga 


aag cat 


cca 


aac 


agt 


ttt 


984 


Arg 


Leu 


Glu 
255 


Gly 


Asn 


Pro 


He 


Val 
260 


Leu 


Gly 


Lys His 


Pro 
265 


Asn 


Ser 


Phe 




att 


tgc 


tta 


aaa 


aga 


tta 


ccg 


ata 


ggg 


tea 


tac ttt 


taacctctat 




1030 


He 


Cys 


Leu 


Lys 


Arg 


Leu 


Pro 


He 


Gly 


Ser 


Tyr Phe 













270 275 280 



tggtacaaca tataaatgaa agtacaccta cactaatagt ctgtctcaac aatgagtaaa 1090 
ggaacttaag tattggttta atattaacct tgtatctcat tttgaaggaa tttaatattt 1150 
taagcaagga tgttcaaaat cttacatata ataagtaaaa agtaagactg aatgtctacg 1210 
ttcgaaacaa agtaatatga aaatatttaa acagcattac aaaatcctag tttatactag 1270 
actaccattt aaaaatcatg tttttatata aatgcccaaa tttgagatgc attattccta 1330 
ttactaatga tgtaagtacg aggataaatc caagaaactt tcaactcttt gcctttcctg 1390 
gectttactg gatcccaaaa gcatttaagg tacatgttcc aaaaactttg aaaagctaaa 1450 
tgtttcccat gategctcat tcttctttta tgattcatac gttattcctt ataaagtaag 1510 
aactttgttt tcctcctatc aaggcagcta ttttattaaa tttttcactt agtctgagaa 1570. 
atagcagata gtctcatatt taggaaaact ttccaaataa aataaatgtt attctctgat 1630 
aaagagctaa tacagaaatg ttcaagttat tttactttct ggtaatgtct tcagtaaaat 1690 
attttcttta tctaaatatt aacattctaa gtctaccaaa aaaagtttta aactcaagca 1750 
ggccaaaacc aatatgetta taagaaataa tgaaaagttc atccatttct gataaagttc 1810 
tctatggcaa agtctt'tcaa atacgagata actgeaaaat attttccttt tatactacag 1870 
aaatgagaat ctcatcaata aattagttca agcataagat gaaaacagaa tattctgtgg 1930 
tgccagtgca cactaccttc ccacccatac acatccatgt tcactgtaac aaactgaata 1990 
ttcacaa 1997 

<210> 22 
<211> 298 
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<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> 1. .19 

<400> 22 

Met Lys Thr Leu Gin Ser Thr Leu Leu Leu Leu Leu Leu Val Pro Leu 

-15 -10 -5 

lie Lys Pro Ala Pro Pro Thr Gin Gin Asp Ser Arg He He Tyr Asp 

1 5 10 

Tyr Gly Thr Asp Asn Phe Glu Glu Ser He Phe Ser Gin Asp Tyr Glu 

15 20 25 

Asp Lys Tyr Leu Asp Gly Lys Asn He Lys Glu Lys Glu Thr Val He 
30 35 40 45 

He Pro Asn Glu Lys Ser Leu Gin Leu Gin Lys Asp Glu Ala He Thr 

50 55 60 

Pro Leu Pro Pro Lys Lys Glu Asn Asp Glu Met Pro Thr Cys Leu Leu 
s 65 70 75 

. Cys Val Cys Leu Ser Gly Ser Val Tyr Cys Glu Glu Val Asp He Asp 
80 85 90 

Ala Val Pro Pro Leu Pro Lys Glu Ser Ala Tyr Leu Tyr Ala Arg Phe 

95 100 105 

Asn Lys He Lys Lys Leu Thr Ala Lys Asp Phe Ala Asp He Pro Asn 
. 110 115 120 125 

Leu Arg Arg Leu Asp Phe Thr Gly Asn Leu He Glu Asp He Glu Asp 

130 135 140 

Gly Thr Phe Ser Lys Leu Ser Leu Leu Glu Glu Leu Ser Leu Ala Glu 

145 150 155 

Asn Gin Leu Leu Lys Leu Pro Val Leu Pro Pro Lys Leu Thr Leu Phe 

160 165 170 

Asn Ala Lys Tyr Asn Lys He Lys Ser Arg Gly He Lys Ala Asn Ala 

175 180 185 

Phe Lys Lys Leu Asn Asn Leu Thr Phe Leu Tyr Leu Asp His Asn Ala 
190 195 200 205 

Leu Glu Ser Val Pro Leu Asn Leu Pro Glu Ser Leu Arg Val He His 

210 215 220 

Leu Gin Phe Asn Asn He Ala Ser He Thr Asp Asp Thr Phe Cys Lys 

225 230 235 

Ala Asn Asp Thr Ser Tyr He Arg Asp Arg He Glu Glu He Arg Leu 

240 245 250 

Glu Gly Asn Pro He Val Leu Gly Lys His Pro Asn Ser Phe He Cys 

255 260 265 

Leu Lys Arg Leu Pro He Gly Ser Tyr Phe 
270 275 

, <210> 23 
<211> 1746 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> S'UTR 
<222> 1. .9 

<220> 
. <221> CDS 

<222> 10. .1212 

<220> 

<221> 3 ! UTR 
<222> 1213. .1746 
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<220> 

<221> polyA_signal 
<222> 1709. .1714 

<220> 

<221> polyA_site 
<222> 1733. .1746 

<400> 23 

gcctcacca atg gtt ccc ttc ate tat ctg caa gec cac ttt aca etc tgt 51 
Met Val Pro Phe lie Tyr Leu Gin Ala His Phe Thr Leu Cys 
-15 -10 -5 



tct 


ggg 


tgg 


tec 


age 


aca 


tac 


egg 


gac 


etc 


egg 


aag 


ggt 


gtg 


tat 


gtg 


99 


Ser 


Gly 


Trp 
1 


Ser 


Ser 


Thr 


Tyr 
5 


Arg 


Asp 


Leu 


Arg 


Lys 
10 


Gly 


Val 


Tyr 


Val 




ccc 


tac 


ace 


cag 


ggc 


aag 


tgg 


gaa 


ggg 


gag 


ctg 


ggc 


acc 


gac 


ctg 


gta 


147 


Pro 


Tyr 


Thr 


Gin 


Gly 


Lys 


Trp 


Glu 


Gly 


Glu 


Leu 


Gly 


Thr 


Asp 


Leu 


Val 




15 










20 










25 










30 




age 


ate 


ccc 


cat 


ggc 


ccc 


aac 


gtc 


act 


gtg 


cgt 


gee 


aac 


att 


get 


gec 


195 


Ser 


lie 


Pro 


His 


Gly 


Pro 


Asn 


Val 


Thr 


Val 


Arg 


Ala 


Asn 


He 


Ala 


Ala 












35 










40 










45 






ate 


act 


gaa 


tea 


gac 


aag 


ttc 


ttc 


ate 


aac 


ggc 


tec 


aac 


tgg 


gaa 


ggc 


243 


lie 


Thr 


Glu 


Ser 


Asp 


Lys 


Phe 


Phe 


He 


Asn 


Gly 


Ser 


Asn 


Trp 


Glu 


Gly 










50 










55 










60 








ate 


ctg 


ggg 


ctg 


gee 


tat 


get 


gag 


att 


gee 


agg 


cct 


gac 


gac 


tec 


ccg 


291 


lie 


Leu 


Gly 


Leu 


Ala 


Tyr 


Ala 


Glu 


He 


Ala 


Arg 


Pro 


Asp 


Asp 


Ser 


Pro 








65 










70 










75 










gag 


cct 


ttc 


ttt 


gac 


tct 


ctg 


gta 


aag 


cag 


acc 


cac 


gtt 


ccc 


aac 


etc 


339 


Glu 


Pro 


Phe 


Phe 


Asp 


Ser 


Leu 


Val 


Lys 


Gin 


Thr 


His 


Val 


Pro 


Asn 


Leu 






80 










85 










90 












ttc 


tec 


ctg 


cag 


ctt 


tgt 


ggt 


get 


ggc 


ttc 


ccc 


etc 


aac 


cag 


tct 


gaa 


387 


Phe 


Ser 


Leu 


Gin 


Leu 


Cys 


Gly 


Ala 


Gly 


Phe 


Pro 


Leu 


Asn 


Gin 


Ser 


Glu 




95 










100 










105 










110 




gtg 


ctg 


gee 


tct 


gtc 


gga 


ggg 


age 


atg 


ate 


att 


gga 


ggt 


ate 


gac 


cac 


435 


Val 


Leu 


Ala 


Ser 


Val 


Gly 


Gly 


Ser 


Met 


He 


He 


Gly 


Gly 


He 


Asp 


His 












115 










120 










125 






teg 


ctg 


tac 


aca 


ggc 


agt 


etc 


tgg 


tat 


aca 


ccc 


ate 


egg 


egg 


gag 


tgg 


483 


ser 


Leu 


Tyr 


Thr 


(jly 


Ser 


Leu 


Trp 


Tyr 


Thr 


Pro 


. He 


Arg 


Arg 


Glu 


Trp 










130 










135 










140 








tat 


tat 


gag 


gtg 


ate 


att 


gtg 


egg 


gtg 


gag 


ate 


aat 


gga 


cag 


gat 


ctg 


531 


Tyr 


Tyr 


Glu 


Val 


He 


He 


Val 


Arg 


Val 


Glu 


He 


Asn 


Gly 


Gin 


Asp 


Leu 








TAR 




















1 r r 

155 










aaa 


atg 


gac 


tgc 


aag 


gag 


tac 


aac 


tat 


gac 


aag 


age 


att 


gtg 


gac 


agt 


579 


Lys 


Met 


Asp 


Cys 


Lys 


Glu 


Tyr 


Asn 


Tyr 


Asp 


Lys 


Ser 


He 


Val 


Asp 


Ser 






160 










165 










170 












ggc 


ace 


ace 


aac 


ctt 


cgt 


ttg 


ccc 


aag 


aaa 


gtg 


ttt 


gaa 


get 


gca 


gtc 


627 


Gly 


Thr 


Thr 


Asn 


Leu 


Arg 


Leu 


Pro 


Lys 


Lys 


Val 


Phe 


Glu 


Ala 


Ala 


Val 




175 










180 










185 










190 




aaa 


tec 


ate 


aag 


gca 


gec 


tec 


tec 


acg 


gag 


aag 


ttc 


cct 


gac 


ggt 


ttc 


675 


Lys 


Ser 


He 


Lys 


Ala 


Ala 


Ser 


Ser 


Thr 


Glu 


Lys 


Phe 


Pro 


Asp 


Gly 


Phe 












195 










200 










205 






tgg 


eta 


gga 


gag 


cag 


ctg 


gtg 


tgc 


tgg 


caa 


gca 


ggc 


acc 


acc 


cct 


tgg 


723 


Trp 


Leu 


Gly 


Glu 


Gin 


Leu 


Val 


Cys 


Trp 


Gin 


Ala 


Gly 


Thr 


Thr 


Pro 


Trp 










210 










215 










220 








aac 


att 


ttc 


cca 


gtc 


ate 


tea 


etc 


tac 


eta 


atg 


ggt 


gag 


gtt 


acc 


aac 


771 


As n 


lie 


Phe 


Pro 


Val 


He 


Ser 


Leu 


Tyr 


Leu 


Met 


Gly 


Glu 


Val 


Thr 


Asn 








225 










230 










235 










cag 


tec 


ttc 


cgc 


ate 


acc 


ate 


Ctt 


ccg 


cag 


caa 


tac 


ctg 


egg 


cca 


gtg 


819 


Gin 


Ser 


Phe 


Arg 


He 


Thr 


He 


Leu 


Pro 


Gin 


Gin 


Tyr 


Leu 


Arg 


Pro 


Val 






240 










245 










250 










gaa 


gat 


gtg 


gec 


acg 


tec 


caa 


gac 


gac 


tgt 


tac 


aag 


ttt 


gee 


ate 


tea 


867 


Glu 


Asp 


Val 


Ala 


Thr 


Ser 


Gin 


Asp 


Asp 


Cys 


Tyr 


Lys 


Phe 


Ala 


He 


Ser 
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255 260 265 270 

cag tea tec acg ggc act gtt atg gga get gtt ate atg gag ggc ttc 915 

Gin Ser Ser Thr Gly Thr Val Met Gly Ala Val lie Met Glu Gly Phe 

275 280 285 

tac gtt gtc ttt gat egg gee cga aaa cga att ggc ttt get gtc age 963 
Tyr Val Val Phe Asp Arg Ala Arg Lys Arg lie Gly Phe Ala Val Ser 

290 295 300 

get tgc cat gtg cac gat gag ttc agg acg gca gcg gtg gaa ggc cct 1011 
Ala Cys His Val His Asp Glu Phe Arg Thr Ala Ala Val Glu Gly Pro 

305 310 315 

ttt gtc ace ttg gac atg gaa gac tgt ggc tac aac att cca cag aca 1059 
Phe Val Thr Leu Asp Met Glu Asp Cys Gly Tyr Asn lie Pro Gin Thr 

320 325 330 

gat gag tea acc etc atg ace ata gee tat gtc atg get gee ate tgc 1107 
Asp Glu Ser Thr Leu Met Thr lie Ala Tyr Val Met Ala Ala lie Cys 
335 340 345 350 

gee etc ttc atg ctg cca etc tgc etc atg gtg tgt cag tgg cgc tgc 1155 
Ala Leu Phe Met Leu Pro Leu Cys Leu Met Val Cys Gin Trp Arg Cys 

355 360 365 

etc cgc tgc ctg cgc cag cag cat gat gac ttt get gat gac ate tec 1203 
Leu Arg Cys Leu Arg Gin Gin His Asp Asp Phe Ala Asp Asp lie Ser 

370 375 380 

ctg ctg aag tgaggaggee catgggcaga agatagggat tcccctggac 1252 
Leu Leu Lys 
385 

cacacctccg tggttcactt tggtcacaag taggagacac agatggcacc tgtggccaga 1312 
gcacctcagg accctcccca cccaccaaat gcctctgcct tgatggagaa ggaaaaggct 1372 
ggcaaggtgg gttccaggga ctgtacctgt aggagacaga aaagagaaga aagaagcact 1432 
ctgctggcgg gaatactctt ggtcacctca aatttaagtc gggaaattct getgettgaa 1492 
acttcagccc tgaacctttg tcaccattcc tttaaattct ccaacccaaa gtattcttct 1552 
tttcttagtt tcagaagtac tggcatcaca cgcaggttac cttggcgtgt gtccctgtgg 1612 
taccctggca gagaagagac caagcttgtt tccctgctgg ccaaagtcag taggagagga 1672 
tgcacagttt getatttget ttagagacag ggactgtata aacaagecta acattggtgc 1732 
aaaaaaaaaa aaaa 1746 

<210> 24 
<211> 401 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> 1. .17 



<400> 24 



Met 


Val 


Pro 
-15 


Phe 


He 


Tyr 


Leu 


Gin 
-10 


Ala 


His 


Phe 


Thr 


Leu 
-5 


Cys 


Ser 


Gly 


Trp 


Ser 
1 


Ser 


Thr 


Tyr 


Arg 
5 


Asp 


Leu 


Arg 


Lys 


Gly 
10 


Val 


Tyr 


Val 


Pro 


Tyr 
15 


Thr 


Gin 


Gly 


Lys 


Trp 
20 


Glu 


Gly 


Glu 


Leu 


Gly 
25 


Thr 


Asp 


Leu 


Val 


Ser 
30 


He 


Pro 


His 


Gly 


Pro 
35 


Asn 


Val 


Thr 


Val 


Arg 
40 


Ala 


Asn 


He 


Ala 


Ala 
45 


He 


Thr 


Glu 


Ser 


Asp 
50 


Lys 


Phe 


Phe 


He 


Asn 
55 


Gly 


Ser 


Asn 


Trp 


Glu 
60 


Gly 


He 


Leu 


Gly 


Leu 
65 


Ala 


Tyr 


Ala 


Glu 


He 
70 


Ala 


Arg 


Pro 


Asp 


Asp 
75 


Ser 


Pro 


Glu 


Pro 


Phe 


Phe 


Asp 


Ser 


Leu 


Val 


Lys 


Gin 


Thr 


His 


Val 


Pro 


Asn 


Leu 


Phe 


Ser 


80 










85 










90 










95 


Leu 


Gin 


Leu 


Cys 


Gly 
100 


Ala 


Gly 


Phe 


Pro 


Leu 
105 


Asn 


Gin 


Ser 


Glu 


Val 
110 


Leu 


Ala 


Ser 


Val 


Gly 
115 


Gly 


Ser 


Met 


He 


He 
120 


Gly 


Gly 


He 


Asp 


His 
125 


Ser 


Leu 



20 



WO 02/094864 



PCT/IB01/01715 



A y* 


XXXI 


yjj. y 
130 




Leu Trp 


J. jr 1 X XXX 

135 


Pro 

XTX w 


He 


Arcr 


A "TCI 


Glu 
140 


ixxy 


xyx 


Tvr 
xyr 


Glu 


Val 
145 


lie 


lie 


Val Arg 


Val Glu 
150 


He 


Asn 


Glv 


Gin 
155 


Asd 


Leu 


Lys 


Met 


Asp 


Cys 


Lys 


Glu 


Tyr Asn 


Tyr Asp 


Lys 


Ser 


He 


Val 


Asp 


Ser 


Glv 


Thr 


160 








165 








170 










175 


Thr 


As 11 


Leu 


Arcr 


Leu Pro 
180 


TjVS IiVS 


Val 


Phe 
185 


Glu 


Ala 


Ala 


Val 


Lys 
190 


Ser 


lie 


Lys 


Ala 


Ala 
195 


Ser Ser 


Thr Glu 


Lys 
200 


Phe 


Pro 


Asp 


Glv 


Phe 
205 


XI }J 


Leu 


Glv 


Glu 


Gin 
210 


Leu 


Val Cys 


Trp Gin 
215 


Ala 


Glv 


Thr 


Thr 


Pro 
220 


±J -h f 


Asn 


He 


Phe 


225 


Val 


lie 


Ser Leu 


Tyr Leu 
230 


Met 


Glv 


Glu 


Val 
235 


Thr 


Asn 


Gin 


Ser 


Phe 

rile 


Arg 


Tl « 

11C 


Thr 

■L 11 X 


lie ueu 


riu OX11 


Gin 


T\/r 
xyx 


ucu 


Am 


Pro 


Val 


Glu 


A on 


240 








245 








250 










255 


Val 

val 


Al a 

Aid 


Thr 

X1XX 


oci 


will rlo }J 

260 


7\ an rHrQ 

/to i*j uy » 


ryr 


T ,\/ C! 

j-iy o 
265 


Phe 


Al a 

AlO 


Tl e 

11C 


Coy* 

OCX 


PSl n 

will 

270 


Car 
OCX 








X 11X 

275 


Val Mer 

V d X 1'IC u 


uiy Aid 


Val 
val 

280 


Tl e 
lie 


Mpf- 


Ul u 


m v 


Phe 
it lie 

285 


xyxr 


Val 
v dx 


Val 

v cii 




Act* 
290 


At* ft 


Al a ArfT 
xiia ax y 


XJ_y s ax y 

295 


Tie 

11C 


Ol v 
uiy 


Phe 


Al a 
Aid 


Val 
300 


OC1 


Ala 
Aid 






vai 

305 


ni o 


Asp 


OX LI xr lie: 


& TYT T Vl T* 

ai y x xxx 

310 


Al a 
Ala 


Al a 
Aid 


Val 
v ai 


(31 11 

UlU 

315 


fll V 

uiy 


Pro 


Jtrllt; 


Val 
vdi 


X11X 


Leu 


Asp 




ulU ABp 


vjys vjj.y 


Tyr 


Asn 


Tl *a 


Pro 


r»1 -n 

Ulll 


1 111 


Asp 


Vjl Li 


320 








325 








330 










335 


Ser 


Thr 


Leu 


Met 


Thr He 
340 


Ala Tyr 


Val 


Met 
345 


Ala 


Ala 


He 


Cys 


Ala 
350 


Leu 


Phe 


Met 


Leu 


Pro 
355 


Leu Cys 


Leu Met 


Val 
360 


Cys 


Gin 


Trp 


Arg 


Cys 
365 


Leu 


Arg 


Cys 


Leu 


Arg 
370 


Gin 


Gin His 


Asp Asp 
375 


Phe 


Ala 


Asp 


Asp 


He 
380 


Ser 


Leu 


Leu 



Lys 

<210> 25 
<211> 1239 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> 5'UTR 
<222> 1..126 

<220> 
<221> CDS 
<222> 127. .879 

<220> 

<221> 3 , UTR 
<222> 880. .1239 

<220> 

<221> polyA_site 
<222> 1224. .1239 

<400> 25 

agtctaggat cctcacacca getacttgea agggagaagg aaaaggccag taaggcctgg 60 
gecaggagag tcccgacagg agtgtcaggt ttcaatctca gcaccagcca ctcagagcag 120 
ggcacg atg ttg ggg gec cgc etc agg etc tgg gtc tgt gec ttg tgc 168 
Met Leu Gly Ala Arg Leu Arg Leu Trp Val Cys Ala Leu Cys 
-20 -15 -10 

age gtc tgc age atg age gtc etc aga gee tat ccc aat gec tec cca 216 
Ser Val Cys Ser Met Ser Val Leu Arg Ala Tyr Pro Asn Ala Ser Pro 
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-5 










1 








5 








Ctg etc 


y y 


tec 


age 


taa 
*-yy 


□at 
y y u 


aac 
yy*- 


eta 


ate 


cac 


ctg 


tac 


aca 


CfCC 


aca 


264 


Leu Leu 


Glv 
10 


Ser 


Ser 


Tro 


Glv 


Glv 
15 


Leu 


lie 


His 


Leu 


Tvr 
20 


Thr 


Ala 


Thr 




gec agg 


aac 


age 


tac 


cac 


ctg 


cag 


ate 


cac 


aag 


aat 


ggc 


cat 


gtg 


gat 


312 


Ala Arg 


Asn 


Ser 


Tvr 


His 


Leu 


Gin 


lie 


His 


Lys Asn 


Glv 


His 


Val 


Asp 




25 










30 










35 












ggc gca 


ccc 


cat 


cag 


acc 


ate 


tac 


agt 


gee 


ctg atg 


ate 


aga 


tea 


gag 


360 


Gly Ala 


Pro 


His 


Gin 


Thr 


lie 


Tyr 


Ser 


Ala 


Leu 


Met 


lie 


Arq 


Ser 


Glu 




40 








45 










50 










55 




gat get 


qqc 


ttt 


Qtq 


qtq 


att 


aca 


pert 




atg 


age 


aqa 


aqa 


tac 


etc 


408 


Asp Ala 


Glv 


Phe 


Val 
60 


Val 


lie 


Thr 


Glv 


Val 
65 


Met 


Ser 


Ara 

^•3 


Ara 


Tvr 
70 


Leu 




tQC atQ 


gat 


ttc 


aqa 


aac 
yy 


aac 


att 


ttt 


qaa 


tea 


cac 


tat 


ttc 


gac 


ccg 


456 


Cys Met 


Asp 


Phe 
75 


Ara 


Glv 


Asn 


lie 


Phe 
80 


Glv 


Ser 


His 


Tvr 


Phe 
85 


Asp 


Pro 




QAn aac 






ttc 


caa 


cac 


cag 


acg 


ctg 


gaa 


aac 


exact 

yyy 


tac 


aac 


gtc 


504 


Glu Asn 


Cys 


Ara 


Phe 


Gin 


His 


Gin 


Thr 


Leu 


Glu Asn 


Glv 


Tvr* 


Asp 


Val 






90 










95 










100 










tac cac 


tct 


cct 


cag 


tat 


cac 


ttc 


ctg 


gtc 


agt 


ctg 


ggc 


egg 


gcg 


aag 


552 


Tyr His 


Ser 


Pro 


Gin 


Tvr 


His 


Phe 


Leu 


Val 


Ser 


Leu 


Glv 


Ara 


Ala 


Lys 




105 










110 










115 












aga gec 


ttc 


ctg 


cca 


ggc 


atg 


aac 


cca 


ccc 


ccg 


tac 


tec 


cag 


ttc 


ctg 


600 


Arg Ala 


Phe 


Leu 


Pro 


Glv 


Met 


Asn 


Pro 


Pro 


Pro Tyr 


Ser 


Gin 


Phe 


Leu 




120 








125 










130 










135 




tec egg 


agg 


aac 


gag 


ate 


ccc 


eta 


att 


cac 


ttc 


aac 


acc 


ccc 


ata 


cca 


648 


Ser Arg 


Ara 


Asn 


Glu 
140 


lie 


Pro 


Leu 


lie 


His 
145 


Phe 


Asn 


Thr 


Pro 


lie 
150 


Pro 




co a cog 


cac 


acc 


caa 
<-yy 


age 


gee 


y a y 


gac 


gac 


teg gag 


caa 
*-yy 


gac 


ccc 


ctg 


696 


Arg Arg 


His 


Thr 
155 


Ara 


Ser 


Ala 


Glu 


Asp 
160 


Asp 


Ser 


Glu 


Aira 


Asp 
165 


Pro 


Leu 




aac gtg 


ctcr 


aag 


CCC 


caa 
*-yy 


gec 


caa 
^yy 


ata 


acc 


ccg 


gec 


ccg 


gee 


tec 


tat 
L y L - 


744 


Asn Val 

"Oil V CL-L. 


Leu 
170 


T,VR 
jj jr o 


Pro 




Ala 


175 


Met 


Thr 


Pro 


Ala 


Pro 
180 


Ala 


Ser 






tea cag 


gag 


etc 


ccg 


age 


gee 


gag 


gac 


aac 


age 


ccg 


atg 


gec 


agt 


gac 


792 


Ser Gin 


m ii 

bill 


Leu 


Pro 


Ser 




m ii 


Asp 


Asn 


Ser 


Pro 


Mot- 
ive u 


Ala 


Ser 


Asp 




185 










190 










195 












cca tta 


ggg 


gtg 


gte 


agg 


ggc 


ggt 


cga 


gtg 


aac 


acg 


cac 


get 


ggg 


gga 


840 


Pro Leu 


Gly 


Val 


Val 


Arg 


Gly 


Gly 


Arg 


Val 


Asn 


Thr 


His 


Ala 


Gly 


Gly 




200 








205 










210 










215 




acg ggc 


ccg 


gaa 


ggc 


tgc 


cgc 


ccc 


ttc 


gec 


aag 


ttc 


ate 


tagggtcget 


889 


Thr Gly 


Pro 


Glu 


Gly 
220 


Cys 


Arg 


Pro 


Phe 


Ala 
225 


Lys 


Phe 


lie 











ggaagggcac cctctttaac ccatccctca geaaaegcag ctcttcccaa ggaccaggtc 949 
ecttgaegtt ccgaggatgg gaaaggtgac aggggcatgt atggaatttg ctgcttctct 1009 
ggggtccctt ccacaggagg tcctgtgaga accaaccttt gaggeccaag tcatggggtt 1069 
tcaccgcctt cctcactcca tatagaacac ctttcccaat aggaaacccc aacaggtaaa 1129 
ctagaaattt ccccttcatg aaggtagaga gaaggggtct ctcccaacat atttctcttc 1189 
cttgtgcctc tcctctttat cacttttaag catgaaaaaa aaaaaaaaaa 1239 



<210> 26 

<211> 251 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> 1. .24 



<400> 26 

Met Leu Gly Ala Arg Leu Arg Leu Trp Val Cys Ala Leu Cys Ser Val 

-20 " -15 -10 

Cys Ser Met Ser Val Leu Arg Ala Tyr Pro Asn Ala Ser Pro Leu Leu 
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-5 




1 






b 




Gly Ser 


Ser Trp 


Gly Gly Leu 


lie His 


Leu 


Tyr Thr 


Aia inr Ala 


Arg 


10 




lb 






o n 
z u 






Asn Ser 


Tyr His 


Leu Gin lie 


nl s i>y s 


Asn 


Gly His 


vai Asp Giy 


Ala 






J u 












Pro His 


Gin Tnr 


lie ryr faer 


Ala i>eu 


wee 


lie ATy 


O^^" 1 lion 

oer taiu Asp 


Aia 






4b 




b 0 




33 




Gly phe 


Val Val 


lie Tnr Gly 


vai Met 


Ser 


Arg Arg 


ryr lieu Lrys 


Men 




60 




©b 






/ 0 




Asp Phe 


Arg Gly 


Asn lie Phe 


Gly Ser 


TT -I ~ 

HIS 


Tyr Phe 


Asp Pro Glu 


Asn 




75 




tiO 






OS 




Cys Arg 


Phe Gin 


Hxs Gin Thr 


Leu Glu 


Asn 


Gly Tyr 


Asp Val Tyr 


His 


90 










inn. 
1U0 






Ser Pro 


Gin Tyr 


His Pne Leu 


val ser 


Leu 


Gly Arg 


Ala Lys Arg 


Til - 

Aia 


105 




"tin 
110 






llO 






Phe Leu 


Pro Gly 


Met Asn Pro 


Pro Pro 


Tyr 


Ser Gin 


Phe Leu Ser 


Arg 






125 




130 




1 o r 

135 




Arg Asn 


Glu lie 


Pro Leu lie 


His Phe 


Asn 


Thr Pro 


He Pro Arg 


Arg 




140 




145 






lbO 




His Thr 


Arg Ser 


Ala Glu Asp 


Asp Ser 


Glu 


Arg Asp 


Pro Leu Asn 


Val 




155 




160 






16b 




Leu Lys 


Pro Arg 


Ala Arg Met 


Thr Pro 


Ala 


Pro Ala 


Ser Cys Ser 


Gin 


170 




175 






180 






Glu Leu 


Pro Ser 


Ala Glu Asp 


Asn Ser 


Pro 


Met Ala 


Ser Asp Pro 


Leu 


185 




190 






195 




200 


Gly Val 


Val Arg 


Gly Gly Arg 


Val Asn 


Thr 


His Ala 


Gly Gly Thr 


Gly 






205 




210 




215 




Pro Glu 


Gly Cys 


Arg Pro Phe 


Ala Lys 


Phe 


He 








220 




225 











<210> 27 
<211> 1179 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> 5'UTR 
<222> 1. .115 

<220> 
<221> CDS 
<222> 116.. 961 

<220> 

<221> 3'UTR 
<222> 962. .1179 

<220> 

<221> polyA_signal 
<222> 1145. .1150 

<220> 

<221> polyA_site 
<222> 1164. .1179 

<400> 27 

acaaattccc aatgcagtta caggatcctg ggaagcagag tgtctggatg gaacctgagc 60 
tgggtctctg actcacttct gactttaggc gctcgaggac tgtgcccagg agcag atg 118 

Met 
1 

egg etc aga gec cag gtg cgc ctg ctt gag acc egg gtc aaa cag caa 166 
Arg Leu Arg Ala Gin Val Arg Leu Leu Glu Thr Arg Val Lys Gin Gin 
5 10 15 
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cag gtc 


aag 


ate 


aag 


cag 


ctt 


tta 
LLy 


cag 


y<*y 


aat 


gaa 


y t-L 


parr 
Lay 


L LL LLC 




Gin Val 




Tl e 

XXG 


Lys 


Gin 


Leu 


LI 


m n 

OJ.ll 


m n 
c»x u 


21 can 


Rill 

uX LL 


v dl 


m n 

VJXXl 


r*ne Lieu 






20 










25 










30 








gat aaa 


aaa 


gat 


aaa 


aat 


act 


gtc 


gtt 


gat 


Ctt 


aaa 
yy<* 


age 


aag 


A PTPT part 

ayy Lay 


OKI 


Asp Lys 


Glv 
oxy 


Asp 


Glu 


Asn 


Thr 


Val 


Val 


Asp 


Leu 


Glv 
oiy 


Ser 


xiy o 


niy v?xn 




35 










40 










45 










tat gca 


aat 
y 


tgt 


tea 


aaa 

y »y 


att 


ttc 


aat 


gat 


aaa 
yyy 


tat 


aaa 

a. ay 


etc 


apt* nna 

dy l yya 


"^1 ft 


Tyr Ala 


Asp 


Cys 


Ser 


Glu 


He 


Phe 


Asn 


Asp 


Glv 
oxy 


Tvt* 
xyx 


Lys 


Tien 


Car ^21 

ocx oxy 




50 








55 










60 








65 




ttt tac 


aaa 


ate 


aaa 


cct 


etc 


cag 


age 


cca 


gca 


gaa 


ttt 


tct 


y L L LdL 


j DO 


Phe Tyr 


Lys 


lie 


Lys 


Pro 


Tien 


Gin 


OCX 


XrXL* 


AT a 

Hid 


OX LI 


Phe 

XT 11C 


Ser 


vax lyr 










70 










75 










OA 

ou 




tgt gac 


atg 


tec 


aat 


aaa 


aaa 
yy a 


aaa 

yy° 


taa 

L yy 


act 


gta 


att 


Lay 


A PT A 

ay d 


cga tct 


Aft C 


Cvs ASTi 


Met 


Ser 


Asp 


Glv 


Gly 


Gly 


lip 


111X 


v di 


Tl e 
lie 


vsin. 


Arg 


Arg Ser 








85 










-7 U 
















aat aac 


agt 


gaa 


aac 


ttt 


aac 


aga 


ncrA 

yyc* 


tyy 


ddd 


gac 


"t - A t" 
LdL 


gaa 


aat yyc 




Artj a~\ v 

no Kt vjx y 


Ser 


Glu 


Asn 


Phe 




Arg 


uxy 


Trp 


Lys 


Asp 


Tyr 


oJLU 


Asn Gly 






100 










105 










11U 








ttt gga 


amt 


ttt 


gtc 


caa 


aaa 


pa +* 


yy t 


y dd 


LdL 


tgg 


ctg 


ggc 


aat aaa 




Phe Gly 


Xaa 


xriic 


V dl 


UXil 


T ,va 


nx a 


vjxy 


Lai LL 


Tyr 


Trp 


Leu 


pi,# 


Asn Lys 




115 










120 

X a V 










X^D 










aat ctt 


cac 


ttc 


tta 
u Ly 


ace 


act 


a ^ 
Ldd 


y ad 


y dL 


tac 


dLL 




aaa 


ate gac 


CCA 


2\ c? n Tien 


xi J. o 


It lie 


Leu 


-L LXJL 


■LUX. 




m n 


Asp 


Tyr 


inr 


Leu 


Lys 


lie Asp 




130 








135 










1 Aft 












ctt gca 






gaa 


aaa 


aat 


age 


cgt 


+- -a +- 


gca 


caa 


LdL 


aag 


aat ttc 


598 


T.eu Ala 


Asp 


Phe 


UJ. Li 


u y o 


A C!TI 
noil 


Q £3 

OCX 


Arg 


Tyr 


nld 




Tyr 


Lys 


Asn Jfne 










150 




















lDU 




aaa att 


aaa 
yy** 


aat 


gaa 


aag 


aat 


ttc 


tac 


y<*y 


ttg 


ddL 


dUU 


ggg 


gaa tat 




IiVS Val 


Gly 


Zi en 




uy a 


-riail 




Tyr 


m ii 


Leu 


Asn 


iie 


ijiy 


Glu Tyr 








165 










170 










1/3 






tct aaa 


aca 


apt 
y l l 


yy« 


aat 
y a l 


LLL 


LLL 


gcg 


ggg 


dd L 


ULL 


Cat 


cct 


gag gtg 


£T Q A 


Ser Gly 


Thr 


Ala 


Gly 




Car 
OCX. 


T .on 


Til a 
Hid 




Asn 


T)Tno 

irne 


nl B 


Pro 


r*l->i it's! 

vjxu vai 






180 










185 










X J7U 








Lay uyy 


taa 


get 


agt 


cac 


caa 


aga 


atg 


ddd 


L LL 


age 


acg 


tgg 


gac aga 


T/IO 

/ 4z 


Gin Trp 




Ala 


Ser 


His 


Gin 




Met- 
l'lc L 


Lys 


Jrlitr 


Ser 


Thr 


Trp 


Asp Arg 




195 










200 




















aat cat* 

MMW WML 


gac 


aac 


tat 


gaa 


yyy 


aac 


LyL 


gca 


gaa 


gaa 


gat 


cag 


tct ggc 




nofct HID 








L71 LI 




Asn 


Cys 


Ala 


LJlU 


Pin 
L7XU 


Asp 




oer oxy 




210 








215 










<s u 








O "D 




taa taa 
u yy u yy 


ttt 


aac 


aaa 
a yy 


tat 
L-y l. 


cac 


tyt 


pr/"* a 
y La 


aac 


ctg 


dd u 


ggt 


rr-t- A 
y La 


tac tac 


q *a o 


Trp Trp 


Phe 


Asn 


.fixy 


V— jr © 


nx o 


Aact 


Ala 


Asn 


*Leu 


Asn 


oxy 


vaj. 


lyr lyr 










230 










235 










240 




acre aac 


ccc 


tac 


acg 


act 


aaa 


aca 


gac 


aat 


ggg 


att 


gtc 


tgg 


tac ace 


O Q £T 


Ser Gly 


Pro 


Tyr 


Thr 


Ala 


Lys 


Thr 


Asp 


Asn 


Gly 


He 


Val 


Trp 


Tyr Thr 








245 










250 










255 






tgg cat 


ggg 


tgg 


tgg 


tat 


tct 


ctg 


aaa 


tct 


gtg 


gtt 


atg 


aaa 


att agg 


934 


Trp His 


Gly 


Trp 


Trp 


Tyr 


Ser 


Leu 


Lys 


Ser 


Val 


Val 


Met 


Lys 


He Arg 






260 










265 










270 








cca aat 


gat 


ttt 


att 


cca 


aat 


gta 


att 


taattgctgc tgttgggctt 


981 


Pro Asn 


Asp 


Phe 


He 


Pro 


Asn 


Val 


He 

















275 280 

tcgtttctgc aattcagctt tgtttaaagt gatttgaaaa atactcattc tgaacatatc 1041 

catgegcaat catgataact gttgtgagta gtgettttea ttcttctcac ttgcctttgt 1101 

tacttaatgt gctttcagta cagcagatat gcaatattca ccaaataaat gtagactgtg 1161 

tcaaaaaaaa aaaaaaaa 1179 



<210> 28 

<211> 282 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> UNSURE 
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<222> 116 

<223> Xaa = Asn, Thr 
<220> 

<221> UNSURE 
<222> 233 

<223> Xaa = Phe, Ser 



<400> 28 

Met Arg Leu Arg Ala Gin Val Arg Leu Leu Glu Thr Arg Val Lys Gin 

15 10 15 

Gin Gin Val Lys lie Lys Gin Leu Leu Gin Glu Asn Glu Val Gin Phe 

20 25 30 

Leu Asp Lys Gly Asp Glu Asn Thr Val Val Asp Leu Gly Ser Lys Arg 

35 40 45 

Gin Tyr Ala Asp Cys Ser Glu lie Phe Asn Asp Gly Tyr Lys Leu Ser 

50 55 60 

Gly Phe Tyr Lys lie Lys Pro Leu Gin Ser Pro Ala Glu Phe Ser Val 
65 70 75 80 

Tyr Cys Asp Met Ser Asp Gly Gly Gly Trp Thr Val lie Gin Arg Arg 

85 90 95 

Ser Asp Gly Ser Glu Asn Phe Asn Arg Gly Trp Lys Asp Tyr Glu Asn 

100 105 110 

Gly Phe Gly Xaa Phe Val Gin Lys His Gly Glu Tyr Trp Leu Gly Asn 

115 120 125 

Lys Asn Leu His Phe Leu Thr Thr Gin Glu Asp Tyr Thr Leu Lys He 

130 135 140 

Asp Leu Ala Asp Phe Glu Lys Asn Ser Arg Tyr Ala Gin Tyr Lys Asn 
145 150 155 160 

Phe Lys Val Gly Asp Glu Lys Asn Phe Tyr Glu Leu Asn He Gly Glu 

165 170 175 

Tyr Ser Gly Thr Ala Gly Asp Ser Leu Ala Gly Asn Phe His Pro Glu 

180 185 190 

Val Gin Trp Trp Ala Ser His Gin Arg Met Lys Phe Ser Thr Trp Asp 

195 200 205 

Arg Asp His Asp Asn Tyr Glu Gly Asn Cys Ala Glu Glu Asp Gin Ser 

210 215 220 

Gly Trp Trp Phe Asn Arg Cys His Xaa Ala Asn Leu Asn Gly Val Tyr 
225 230 235 240 

Tyr Ser Gly Pro Tyr Thr Ala Lys Thr Asp Asn Gly He Val Trp Tyr 

245 250 255 

Thr Trp His Gly Trp Trp Tyr Ser Leu Lys Ser Val Val Met Lys He 

260 265 270 

Arg Pro Asn Asp Phe He Pro Asn Val He 
275 280 



<210> 29 

<211> 1118 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> 5'UTR 
<222> 1. .344 



<220> 
<221> CDS 
<222> 345. .1118 



<220> 

<221> polyA_site 
<222> 1103 . .1118 
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<400> 29 

aatcctagtc ttcgtttggt ccggttgcac tcttcctata gcccagaggg cgagagggcc 60 
tgtggcctgg gggaaggagg acgaggttct gcctggatcc cagcaggacg ctgtgccatt 120 
tgggaacaaa ggaatagtct gcctggaatc cctgcagatc ttggggccgg aggccagtcc 180 
aacccttgga gcaggaagaa acgcaaagtt gtcaagaacc aagtcgagct gcctcagagc 240 
cggcccgcag tagctgcaga ctccgcccgc gacgtgtgcg cgcttctctg ggccagagcg 300 
agcctgtttt gtgctcgggt taagagattt gtcccagcta tacc atg ggc cgc act 356 

Met Gly Arg Thr 

egg gaa get ggc tgc gtg gec get ggt gtg gtt ate ggg get ggt gec 4 04 

Arg Glu Ala Gly Cye Val Ala Ala Gly Val Val He Gly Ala Gly Ala 

-15 -10 -5 1 

tgc tac tgt gta tac aga ctg get tgg gga aga gac gag aac gag aaa 452 

Cys Tyr Cys Val Tyr Arg Leu Ala Trp Gly Arg Asp Glu Asn Glu Lys 

5 10 15 

ate tgg gac gaa gac gag gag tct acg gac acc tea kag att ggg gtt 500 
He Trp Asp Glu Asp Glu Glu Ser Thr Asp Thr Ser Xaa He Gly Val 

20 25 30 

gag act gtg aaa gga get aaa act aac get ggg gca ggg tct ggg gec 548 
Glu Thr Val Lys Gly Ala Lys Thr Asn Ala Gly Ala Gly Ser Gly Ala 

35 40 45 

aaa ctt cag ggt gat tea gag gtc aag cct gag gtg agt ttg gga etc 596 
Lys Leu Gin Gly Asp Ser Glu Val Lys Pro Glu Val Ser Leu Gly Leu 
50 55 60 65 

gag gat tgt ccg ggt gta aaa gag aag gec cat tea gga tec cac age 644 
Glu Asp Cys Pro Gly Val Lys Glu Lys Ala His Ser Gly Ser His Ser 

70 75 80 

gga ggt ggc eta gag gec aag gee aag gec ctt ttc aac acg ctg aag 692 
Gly Gly Gly Leu Glu Ala Lys Ala Lys Ala Leu Phe Asn Thr Leu Lys 

85 90 95 

gaa cag gca agt gca aag gca ggc aaa ggg get agg gtg ggt acc ate • 740 
Glu Gin Ala Ser Ala Lys Ala Gly Lys Gly Ala Arg Val Gly Thr He 

100 105 110 

tct ggg aac agg acc ctt gca ccg agt tta ccc tgc cca gga ggc agg 788 
Ser Gly Asn Arg Thr Leu Ala Pro Ser Leu Pro Cys Pro Gly Gly Arg 

115 120 125 

ggt gga ggc tgc cac ccc acc agg agt gga tct agg gec ggg ggc agg 836 
Gly Gly Gly Cys His Pro Thr Arg Ser Gly Ser Arg Ala Gly Gly Arg 
130 135 140 145 

gca agt gga aaa tec aag gga aag gee cga agt aag age acc agg get 884 
Ala Ser Gly Lys Ser Lys Gly Lys Ala Arg Ser Lys Ser Thr Arg Ala 

150 155 160 

cca get aca aca tgg cct gtc egg aga ggc aag ttc aac ttt cct tat 932 
Pro Ala Thr Thr Trp Pro Val Arg Arg Gly Lys Phe Asn Phe Pro Tyr 

165 170 175 

aaa att gat gat att ctg agt get ccc gac etc caa aag gtc etc aac 980 
Lys He Asp Asp He Leu Ser Ala Pro Asp Leu Gin Lys Val Leu Asn 

180 185 190 

ate ctg gag cga aca aat gat cct ttt att caa gaa gta gec ttg gtc 102 8 
He Leu Glu Arg Thr Asn Asp Pro Phe He Gin Glu Val Ala Leu Val 

195 200 205 

act ctg ggt aac aat gca gca tat tea ttt aac cag aat gee ata cgt 1076 
Thr Leu Gly Asn Asn Ala Ala Tyr Ser Phe Asn Gin Asn Ala He Arg 
210 215 220 225 

gaa ttg ggt ggt gtc cca att att gca aaa aaa aaa aaa aaa 1118 
Glu Leu Gly Gly Val Pro He He Ala Lys Lys Lys Lys Lys 
230 235 

<210> 30 
<211> 258 
<212> PRT 

<213> Homo sapiens 
<220> 
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<221> SIGNAL 
<222> 1. .20 

<220> 

<z221> UNSURE 
<222> 49 

<223> Xaa = Glu, * 



<400> 30 



Met 


Gly 


Arc* 


Thr Arg Glu Ala Gly Cys 


-20 






-15 


Gly 


Ala 


Glv 


Ala Cys Tyr Cys Val Tyr 








1 5 


Glu 


Asn 


Glu 


Lys He Trp Asp Glu Asp 






15 


20 


Xaa 


He 


Glv 


Val Glu Thr Val Lvs Glv 

VCL-L. \JJLU XXXJ- VO.JL JJjr O VjJLjf 




30 




JO 


Glv 


Ser 


Glv 


Ala Lvs Leu Gin Glv Asn 


45 






50 


Ser 


Leu. 


Glv 


T.pn flT ti A p?ri r*vc3 Prn Rl v 

UCU wlU "tP^J .Y O C1.KJ vjxy 










Glv 


Ser 


His 


uCi V3Xjr VJXjf XJCU VJJ.U 








fin qc 


Asn 


Thr 


Leu 


Lvs Glu Gin AT a Sp»t* A3 a 






95 




Val 


Glv 


Thr 


He Ser Glv Asn Arcr Thr 




110 




115 


Pro 


Gly 


Gly 


Arg Gly Gly Gly Cys His 


125 






130 


Ala 


Gly 


Gly 


Arg Ala Ser Gly Lys Ser 








145 


Ser 


Thr 


Arg 


Ala Pro Ala Thr Thr Trp 








160 165 


Asn 


Phe 


Pro 


Tyr Lys He Asp Asp He 






175 


1B0 


Lys 


Val 


Leu 


Asn He Leu Glu Arg Thr 




190 




195 


Val 


Ala 


Leu 


Val Thr Leu Gly Asn Asn 


205 






210 


Asn 


Ala 


He 


Arg Glu Leu Gly Gly Val 








225 


Lys 


Lys 







<210> 31 

<211> 1273 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5'UTR 
<222> 1..13 

<220> 
<221> CDS 
<222> 14. .1048 

<220> 

<221> 3'UTR 
<222> 1049. .1273 

<220> 

<221> polyA_signal 
<222> 1234. .1239 
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Val 


Ala 


Ala Glv Val Val 


Tl f> 




-10 






Arg Leu 


Ala Tm Glv Arcr 


Asp 










Glu 


Glu 


Ser Thr Aeo Thr 


Ser 










Ala Lys 


Thr Asn Al a Ol v 

■L noil rtx cx \3JL jr 


Al a 






/in 




Ser 


Glu 


Val JJjS r*iO bill 


Vol 




55 




o U 


Val 


Lys 


liyS -H_Lcl JtlXS 


Ser 


70 








Ala Lys 


HXo JjyS .H.JL a. JjeU 


jfne 






on 




Lys 


Ala 




Arg 










Leu 


Ala 


Pro Ser Leu Pro 


Cys 






120 




Pro 


Thr 


Arg Ser Gly Ser 


Arg 




135 




140 


Lys 


Gly 


Lys Ala Arg Ser 


Lys 


150 




155 




Pro 


Val 


Arg Arg Gly Lys 


Phe 






170 




Leu 


Ser 


Ala Pro Asp Leu 


Gin 






185 




Asn Asp 


Pro Phe He Gin 


Glu 






200 




Ala 


Ala 


Tyr Ser Phe Asn 


Gin 




215 




220 


Pro 


He 


He Ala Lys Lys 


Lys 


230 




235 
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<220> 

<221> polyA_site 
<222> 1258. .1273 

<400> 31 

agaggttggg aag atg gcg tgg cga ggc tgg gcg cag aga ggc tgg ggc 49 
Met Ala Trp Arg Gly Trp Ala Gin Arg Gly Trp Gly 
-25 -20 -15 

tgc ggc cag gcg tgg ggt gcg teg gtg ggc ggc cgc age tgc gag gag 97 
Cys Gly Gin Ala Trp Gly Ala Ser Val Gly Gly Arg Ser Cys Glu Glu 

-10 -5 1 

etc act gcg gtc eta acc ccg ccg cag etc etc gga cgc agg ttt aac 145 
Leu Thr Ala Val Leu Thr Pro Pro Gin Leu Leu Gly Arg Arg Phe Asn 

5 10 15 

ttc ttt att caa caa aaa tgc gga ttc aga aaa gca ccc agg aag gtt 193 
Phe Phe lie Gin Gin Lys Cys Gly Phe Arg Lys Ala Pro Arg Lys Val 
20 25 30 35 

gaa cct cga aga tea gac cca ggg aca agt ggt gaa gca tac aag aga 241 
Glu Pro Arg Arg Ser Asp Pro Gly Thr Ser Gly Glu Ala Tyr Lys Arg 
40 45 50 
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gca 


ttt 


yy " 


tea 


get 


act 


att 


t*aa 


caa 


tat 
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He 
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Gin 
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cag 


agt 


tat 


ttt 


gat 


crcrt* 


ata 


aaa 


get. 




tgg 
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\j J. y 


Tl ^ 
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Trp 
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caa 
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Arcr 
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Gin 
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Glu 


Gly 


Asp 


Phe 






U1U 
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Asn 
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tgg 


tgg 
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cag 
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Trp 


Trp 
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Gly 
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aga 


gta 


cct 
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Val 
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cat 
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tta 
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Cys 
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Phe 
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195 


cac 
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gtt 
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tec 
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Tyr 
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Trp 


Ser 


Phe 


Ser 


Ser 


Ser 


He 
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210 




gtg 


aac 


att 


ctg 


ggt 


caa 
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ttc 


atg 


gca 


gtg 


tac 


eta 


tct 


gca 


Val 


Asn 


He 


Leu 


Gly 


Gin 


Glu 


Gin 


Phe 


Met 


Ala 


Val 


Tyr 


Leu 


Ser 


Ala 








215 










220 










225 






ggt 


gtt 


att 
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aat 
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agt 
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gtg 


ggt 
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gtt 
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Val 


He 


Ser 


Asn 


Phe 


Val 


Ser 


Tyr 


Val 


Gly 


Lys 


Val 


Ala 


Thr 


Gly 






230 










235 










240 
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tat 


gga 


cca 
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ctt 


ggt 
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ate 
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atg 
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Gly 
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Ser 
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Ala 


Ala 


Leu 


Lys 
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He 


He 


Ala 


Met 
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aca 
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atg 
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Trp 


Lys 
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Phe 
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His 


Ala 


Ala 


260 
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270 
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433 



481 



529 



577 



625 



673 



721 



769 



817 



865 



913 
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cat ctt ggg gga get ctt ttt gga ata tgg tat gtt act tac ggt cat 961 
His Leu Gly Gly Ala Leu Phe Gly lie Trp Tyr Val Thr Tyr Gly His 

280 285 290 

gaa ctg att tgg aag aac agg gag ccg eta gtg aaa ate tgg cat gaa 1009 
Glu Leu He Trp Lys Asn Arg Glu Pro Leu Val Lys lie Trp His Glu 

295 300 305 

ata agg act aat ggc ccc aaa aaa gga ggt ggc tct aag taaaactggg 1058 
He Arg Thr Asn Gly Pro Lys Lys Gly Gly Gly Ser Lys 
310 315 A 320 

attggacagt agtggtgcat ctggtccttg ccgcctgaga gccccaggag acateggcta 1118 
gagtgaccat ggctatgetc ccgtctggaa gatgecagea tctggcctcc cacttttttc 1178 
agctgtgtcc cccagtccgt gtctttttag aatgtgaatg atgataaagt tgtgaaataa 1238 
aggtttctat etagtttgea aaaaaaaaaa aaaaa 1273 

<210> 32 

<211> 345 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> 1. .26 



<400> 32 
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Trp 
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75 










80 85 


Phe 


Gly 


Ser 


Ala 


Ala 


He 


Trp 


Gin 
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Tyr 


Phe 
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Gly 


He 
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Asp Trp Leu Asp Ser He Arg 
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115 
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Gin 


Lys 


Glu 


Gly 


Asp 


Phe 


Arg 


Lys 


Glu He Asn Lys Trp Trp Asn 




120 










125 






130 


Asn 


Leu 
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Gly 


Gin 


Arg 


Thr 


Val 


Thr Gly He He Ala Ala Asn 


135 










140 








145 150 


Val 


Leu 


Val 


Phe 
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Leu 


Trp 


Arg 


Val 


Pro Ser Leu Gin Arg Thr Met 
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He 
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Tyr 


Phe 


Thr 


Ser 
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Pro 
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Ser Lys Val Leu Cys Ser Pro 








170 
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Met 
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Leu 


Ser 


Thr 


Phe 


Ser 
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Phe 
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185 










190 
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Asn 
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Ser 


Phe 
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Ser Ser He Val Asn He Leu 
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280 285 290 

Lys Asn Arg Glu Pro Leu Val Lys He Trp His Glu He Arg Thr Asn 
295 300 305 ~ 310 

Gly Pro Lys Lys Gly Gly Gly Ser Lys 
315 

<210> 33 

<211> 723 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5»UTR 
<222> 1. .72 

<220> 
<221> CDS 
<222> 73. .672 

<220> 

<221> 3»UTR 
<222> 673. .723 

<220> 

<221> polyA_signal 
<222> 689.. 694 

<220> 

<221> polyA_site 
<222> 708. .723 

<400> 33 

acaagaaaag aacatggtct agactgaagt accaactaaa tcatctcctt tcaaattatc 60 
accgacacca tc atg gat tea age acc gca cac agt ccg gtg ttt ctg gta 111 
Met Asp Ser Ser Thr Ala His Ser Pro Val Phe Leu Val 
15 10 



ttt 


cct 


cca 


gaa 


ate 


act 


get 


tea 


gaa 


tat 


gag 
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gaa 


ctt 


tea 
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caa 
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tta 
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207 
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Phe 
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He 
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He 
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He 
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Thr 
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303 
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ata 
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cga 


ata 
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Thr 
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Thr 


Leu 


He 


He 
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He 
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Phe 


Leu 
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Ser 


Gin 


Cys 
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145 150 155 
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get gtt act gtc ctg ttc ttg gga att 
Ala Val Thr Val Leu Phe Leu Gly lie 

160 165 
age att att gaa tta ttc att tct ctg 
Ser lie He Glu Leu Phe He Ser Leu 

175 180 
cac tea gag gat tgt gat tgt gaa caa 
His Ser Glu Asp Cys Asp Cys Glu Gin 
190 195 
aagatgtgtt aaaataaaaa aaaaaaaaaa t 



ttg att aca ttg atg act ttc 591 
Leu He Thr Leu Met Thr Phe 
170 

cct ttc tea att ttg ggg tgc 639 
Pro Phe Ser He Leu Gly Cys 
185 

tgt tgt tgactagcac tgtgagaata 692 
Cys Cys 
200 

723 



<210> 34 

<211> 200 

<212> PRT 

<213> Homo sapiens 



<400> 34 
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Ser 


Thr Ala 
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Ser 


Pro Val 


Phe Leu Val Phe Pro Pro 
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10 


15 


Glu 


He 
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Ser Glu 


Tyr Glu 


Ser Thr 


Glu Leu Ser Ala Thr Thr 
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30 


Phe 
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Phe Ala Arg Lys Met Lys 
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He 
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Gly 


Thr 


He Gin 


He 
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Phe Gly 


He Met Thr Phe Ser Phe 
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60 
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He 
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Leu Lys 


Pro Tyr Pro Arg Phe Pro 


65 
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He 


Phe 
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Phe Trp 


Gly Ser Val Leu Phe He 
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Gly 


Ala 


Phe Leu 


He 


Ala 


Val Lys 


Arg Lys Thr Thr Glu Thr 
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He 
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Leu 


Ser Arg 


He 


Met 


Asn Phe 


Leu Ser Ala Leu Gly Ala 






115 








120 




125 


He 


Ala 


Gly 


He 


He Leu 


Leu 


Thr 


Phe Gly 


Phe He Leu Asp Gin Asn 




130 








135 






140 


Tyr 


He 


Cys 
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Tyr Ser 
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Gin 


Asn Ser 


Gin Cys Lys Ala Val Thr 


145 








150 








155 160 


Val 
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Phe 
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Gly He 
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He 
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Met Thr Phe Ser He He 
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Glu 
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<210> 35 

<211> 845 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> 5'UTR 
<222> 1. .118 



<220> 
<221> CDS 
<222> 119. .655 



<220> 

<221> 3'UTR 
<222> 656. .845 



<220> 

<221> polyA_signal 
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<222> 809. .814 
<220> 

<221> polyA_site 
<222> 830. .845 

<400> 35 

acaaatagcc ccggatatct gtgttaccag ccttgtctcg gccacctcaa ggataatcac 60 
taaattctgc caaaaggact gaggaacggt gcctggaaaa gggcaagaat atcacggc 118 

ctg aag tat gtc ctg ttt ttc ttc 166 
Leu Lys Tyr Val 
10 

tgc tgc att ttg ggc ttt ggg ate 214 
Cys Cys lie Leu 
25 

gga gtg etc ttc cat aac etc ccc 262 
Gly Val Leu Phe 

gtc- ate gtg ggc tct att ate atg 310 
Val He Val Gly 
60 

ggc tct ate aag gaa aac aag tgt 358 
Gly Ser He Lys 
75 

ctg ctg att ate etc ctt get gag 406 
Leu Leu He 3 
90 

gtg get aag ggt ctg ace gac age 454 
Val Ala Lys Gly 
105 

age acc aag gca gcg tgg gac tec 502 
Ser Thr Lys ; 

ggt ata aat ggc acg agt gat tgg 550 
Gly He Asn Gly 
140 

ccc tea gat cga aaa gtg gag ggt 598 
Pro Ser Asp 1 
155 

ttt cat tec aat ttc ttt att aga 646 
Phe His Ser Asn 
165 170 
ggg cct tat tgatgtgttc taagtctttc cagaaaaaaa ctatccagtg 695 
Gly Pro Tyr 

atttatatcc tgatttcaac cagtcactta gctgataatc acagtaagaa gacttctggt 755 
attatctctc tatcagataa gattttgtta atgtactatt ttactcttca ataaataaaa 815 
cagtttatta tegcaaaaaa aaaaaaaaaa 845 

<210> 36 
<211> 179 
<212> PRT 

<213> Homo sapiens 
<400> 36 

Met Gly Met Ser Ser Leu Lys Leu Leu Lys Tyr Val Leu Phe Phe Phe 

1 5 10 15 

Asn Leu Leu Phe Trp He Cys Gly Cys Cys He Leu Gly Phe Gly He 

20 25 30 

Tyr Leu Leu He His Asn Asn Phe Gly Val Leu Phe His Asn Leu Pro 

35 40 45 

Ser Leu Thr Leu Gly Asn Val Phe Val He Val Gly Ser He He Met 

50 55 60 

Val Val Ala Phe Leu Gly Cys Met Gly Ser He Lys Glu Asn Lys Cys 
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etc 


ccc 


TT -J 0 

rlXS 


Asn 


Lieu 


Pro 


A c 
*± D 








tct 


att 


ate 


atg 


Ser 


He 


He 


Met 


gaa 


aac 


aag 


i- —,4- 

tgt 


GlU 


Asn 


Lys 


Cys 








Q r\ 


etc 


ctt 


get 


gag 


Leu 


Leu 


Ala 


Glu 






95 




ctg 


acc 


gac 


age 


Leu 


Thr 


Asp 


Ser 




110 






gcg 


tgg 


gac 


tec 


Ala 


Trp 


Asp 


Ser 


125 








acg 


agt 


gat 


tgg 


Thr 


Ser 


Asp 


Trp 


aaa 


gtg 


gag 


ggt 


Lys 


Val 


Glu 


Gly 








160 


ttc 


ttt 


att 


aga 


Phe 


Phe 


He 


Arg 






175 
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65 






70 


/ D 




q r\ 
oO 


Leu 


UG IX 


Mct-h Cat- T3"h<=» 


rue lie 


Ucu jjcu Jjcu lie lie lieu 


Leu Ala 


GiU 






85 










Val 


Thr 


TiPii ZVT a Tl A 

11CU J-i. JL d J.JL~ 


ucu 1ICU 


rrixc Val Ala liy o Vjiy lieu 


xur Asp 


Ser 






100 




105 


X JL U 




He 




ru.y iyi nio 


Cot* San 


HSU OCl lili JjyB Ala Ala 


Trp Asp 


Ser 






115 




120 125 






He 


Gin 


Ser Phe Leu 


Gin Cys 


Cys Gly He Asn Gly Thr 


Ser Asp 


Trp 




130 




135 


140 






Thr 


Ser 


Gly Pro Pro 


Ala Ser 


Cys Pro Ser Asp Arg Lys 


Val Glu 


Gly 


145 






150 


155 




160 


Cys 


Tyr 


Ala Lys Ala 


Arg Leu 


Trp Phe His Ser Asn Phe 


Phe He 


Arg 






165 




170 


175 




Gly 


Pro 


Tyr 











<210> 37 
<211> 517 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> 5'UTR 
<222> 1. .16 

<220> 
<221> CDS 
<222> 17. .259 

<220> 

<221> 3'UTR 
<222> 260. .517 

<400> 37 

ttccatagaa tgggag atg tea cca ggg cag cct atg aca ttc ccc cca gag 52 

Met Ser Pro Gly Gin Pro Met Thr Phe Pro Pro Glu 
15 io 



gec 


ctg 


tgg 


gtg 


ace 


gtg 


ggg 


ctg 


tct 


gtc 


tgt 


etc 


att gca 


ctg 


ctg 


100 


Ala 


Leu 


Trp 


Val 


Thr 


Val 


Gly 


Leu 


Ser 


Val 


Cys 


Leu 


He Ala 


Leu 


Leu 








15 










20 










25 








gtg 


gec 


ctg 


get 


ttc 


gtg 


tgc 


tgg 


aga 


aag 


ate 


aaa 


cag age 


tgt 


gag 


148 


Val 


Ala 


Leu 


Ala 


Phe 


Val 


Cys 


Trp 


Arg 


Lys 


He 


Lys 


Gin Ser 


Cys 


Glu 






30 










35 










40 










gag 


gag 


aat 


gca 


gga 


get 


gag 


gac 


cag 


gat 


ggg 


gag 


gga gaa 


ggc 


tec 


196 


Glu 


Glu 


Asn 


Ala 


Gly 


Ala 


Glu 


Asp 


Gin 


Asp 


Gly 


Glu 


Gly Glu 


Gly 


Ser 




45 










50 










55 








60 




aag 


aca 


gec 


ctg 


cag 


cct 


ctg 


aaa 


cac 


tct 


gac 


age 


aaa gaa 


gat 


gat 


244 


Lys 


Thr 


Ala 


Leu 


Gin 


Pro 


Leu 


Lys 


His 


Ser 


Asp 


Ser 


Lys Glu 


Asp 


Asp 





65 70 75 



gga caa gaa ata gee tgaccatgag gaccagggag ctgctacccc tccctacagc 299 
Gly Gin Glu lie Ala 
80 

tcctaccctc tggctgcaat ggggctgeae tgtgagccct gcccccaaca gatgeatect 359 
gctctgacag gtgggctcct tctccaaagg atgegataca cagaccactg tgeagectta 419 
tttctccaat ggacatgatt cccaagtcat cctgctgcct tttttcttat agacacaatg 479 
aacagaccac ccacaacctt agttctctaa gtcatcct 517 

<210> 38 
<211> 81 
<212> PRT 

<213> Homo sapiens 
<400> 38 

Met Ser Pro Gly Gin Pro Met Thr Phe Pro Pro Glu Ala Leu Trp Val 
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15 10 15 

Thr Val Gly Leu Ser Val Cys Leu lie Ala Leu Leu Val Ala Leu Ala 

20 25 30 

Phe Val Cys Trp Arg Lys lie Lys Gin Ser Cys Glu Glu Glu Asn Ala 

35 40 45 

Gly Ala Glu Asp Gin Asp Gly Glu Gly Glu Gly Ser Lys Thr Ala Leu 

50 55 60 

Gin Pro Leu Lys His Ser Asp Ser Lys Glu Asp Asp Gly Gin Glu lie 
65 70 75 80 

Ala 

<210> 39 

<211> 1816 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5'UTR 
<222> 1. .259 

<220> 
<221> CDS 
<222> 260. .1048 

<220> 

<221> 3'UTR 
<222> 1049. .1816 

<220> 

<221> polyA_signal 
<222> 1782. .1787 

<220> 

<221> polyA_site 
<222> 1801.. 1816 

<400> 39 

actctggggc cattgccagc cggctgtagg cattcagggc agtgtcttct gcatctccta 60 
ggaacctcgg gagcggcagc tccggcgcct ggtagcgaga ggcgggttcc ggagatcccg 120 
gcctcacttc gtcccactgt ggttaggggt gagtcctgcg aatgttaagt gatttgctca 180 
aggtgcccat ttcgcaggaa ttggagccca ggccagttct ctgagcctat cattagggct 240 
aaaggagtgc gtgatcaga atg gtg tct gga egg ttc tac ttg tec tgc ctg 292 

Met Val Ser Gly Arg Phe Tyr Leu Ser Cys Leu 
-15 -10 



ctg 


ctg 


ggg 


tec 


ctg 


ggc 


tct 


atg 


tgc 


ate 


etc 


ttc 


act 


ate 


tac 


tgg 


340 


Leu 


Leu 


Gly 


Ser 
-5 


Leu 


Gly 


Ser 


Met 


Cys 
1 


He 


Leu 


Phe 


Thr 
5 


lie 


Tyr 


Trp 




atg 


cag 


tac 


tgg 


cgt 


ggt 


ggc 


ttt 


gee 


tgg 


aat 


ggc 


age 


ate 


tac 


atg 


388 


Met 


Gin 


Tyr 


Trp 


Arg 


Gly 


Gly 


Phe 


Ala 


Trp 


Asn 


Gly 


Ser 


lie 


Tyr 


Met 






10 










15 










20 












ttc 


aac 


tgg 


cac 


cca 


gtg 


ctt 


atg 


gtt 


get 


ggc 


atg 


gtg 


gta 


ttc 


tat 


436 


Phe 


Asn 


Trp 


His 


Pro 


Val 


Leu 


Met 


Val 


Ala 


Gly 


Met 


Val 


Val 


Phe 


Tyr 




25 










30 










35 










40 




gga 


ggt 


gcg 


tea 


ctg 


gtg 


tac 


cgc 


ctg 


ccc 


cag 


teg 


tgg 


gtg 


ggg 


ccc 


484 


Gly 


Gly 


Ala 


Ser 


Leu 


Val 


Tyr 


Arg 


Leu 


Pro 


Gin 


Ser 


Trp 


Val 


Gly 


Pro 












45 










50 










55 






aaa 


ctg 


ccc 


tgg 


aaa 


etc 


etc 


cat 


gca 


gcg 


ctg 


cac 


ctg 


atg 


gee 


ttc 


532 


Lys 


Leu 


Pro 


Trp 


Lys 


Leu 


Leu 


His 


Ala 


Ala 


Leu 


His 


Leu 


Met 


Ala 


Phe 










60 










65 










70 








gtc 


etc 


act 


gtt 


gtg 


ggg 


ctg 


gtt 


get 


gtc 


ttt 


a eg 


ttt 


cac 


aac 


cat 


580 


Val 


Leu 


Thr 


Val 


val 


Gly 


Leu 


Val 


Ala 


Val 


Phe 


Thr 


Phe 


His 


Asn 


His 








75 










80 










85 










gga 


agg 


act 


gec 


aac 


etc 


tac 


tec 


ctt 


cac 


age 


tgg 


ctg 


ggc 


ate 


ace 


628 
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Gly Arg 


Thr Ala Asn Leu Tyr 


Ser 


Leu His 


Ser Trp Leu Gly 


lie 


Thr 






90 


95 






10U 








act 


gtc 


ttc etc ttc ggc tgc 


cag 


tgg ttc 


ctg ggc ttc get 


gtc 


ttc 


b /© 


Thr 


Val 


Phe Leu Phe Gly Cys 


Gin 


Trp Phe 


Leu Giy fRQ A±a 


vai 


rne 




105 




110 






lib 








etc 


ctg 


ccc tgg gcg tec atg 


tgg 


ctg cgc 


age ccc una aaa 


cct 


ate 


1 A*± 


Leu 


Leu 


Pro Trp Ala Ser Met 


Trp 


Leu Arg 


ser jjeu iieu i»ys 


Pro 


lie 








125 














cac 


gtc 


ttt ttt gga gee gec 


ate 


etc tct 


ctg tec ate gea 


tec 


gtc 


1 /Z 


His 


Val 


Phe Phe Gly Ala Ala 


lie 


Leu Ser 


Leu Ser lie Ala 


Ser 


vai 








140 




14 b 


i c n 
lbU 








att 


teg 


ggc att aat gag aag 


ctt 


ttc ttc 


agt ttg aaa aac 


acc 


acc 




lie 


Ser 


Gly lie Asn Glu Lys 


Leu 


Phe Phe 


Ser Leu Lys Asn 


Tnr 


Thr 








155 


160 




lob 








agg 


cca 


tac cac age ctg ccc 


agt 


gag gcg 


gtc ttt gec aac 


age 


acc 


aba 


Arg Pro 


Tyr His Ser Leu Pro 


Ser 


Glu Ala 


Val Phe Ala Asn 


Ser 


Thr 






170 


175 






180 








ggg 


atg 


ctg gtg gtg gec ttt 


ggg 


ctg ctg 


gtg etc tac ate 


ctt 


ctg 


916 


Gly Met 


Leu Val Val Ala Phe 


Gly 


Leu Leu 


Val Leu Tyr He 


Leu 


Leu 




185 




190 






195 




200 




get 


tea 


tct tgg aag cgc cca 


gag 


ccg ggg 


ate ctg acc gac 


aga 


cag 


964 


Ala 


Ser 


Ser Trp Lys Arg Pro 


Glu 


Pro Gly 


He Leu Thr Asp 


Arg 


Gin 








205 




210 




215 






ctg ctg 


eta cag ctg agg cct 


gga 


tec egg 


cct ttc cct gtg 


act 


tac 


1012 


Leu 


Leu 


Leu Gin Leu Arg Pro 


Gly 


Ser Arg 


Pro Phe Pro Val 


Thr Tyr 








220 




225 


230 








gtg 


tct 


gtc acc ggc agg cag 


ccc 


tac aaa 


tec tgg tgacctgctc 




1058 


Val 


Ser 


Val Thr Gly Arg Gin 


Pro 


Tyr Lys 


Ser Trp 












235 


240 













tcccaagaac agagcctgtc cccagatgtc ecagtagega tgagtaacag aggtggctgt 1118 
ggacttcctc tacttctcct tgctggatca gggccttcct gcctcccgct gggcaggtct 1178 
ggccttgctc tcttggcagg gccccagccc ctctgaccac tctgcagctc accatgcagc 1238 
tgatgccaaa gttgtggtgt ccagtgtgca gcagccctgg gagccactgc caccttcaga 1298 
ggggttcctt gctgagaccc acattgette acctggcccc accatggctg cttgcctggc 1358 
ccaacctagc gttctgtgcc atgetagaac ttgagctgtt gctcttcttc aggggaggaa 1418 
atagggtgga gagegggaag ggtcttgetc ctaagtgttg ctgctgtggc ttttttgect 1478 
tctccaaaga cgcactgcca ggtcccaagc ttcagactgc tgtgcttagt aagcaagtga 153 8 
gaagcctggg gtttggagcc cacctactct ctggcagcat cagcatccta ctcctggcaa 1598 
catcaggcca acgtccaccc cagcctcaca ttgccagatg ttggcagaag ggctaatatt 1658 
gaeegtcttg actggctgga gecttcaaag ccactgggat gtcctccagg cacctgggtc 1718 
ccatgaccag ctccccgtct ccataggggt aggcatttca ctggtttatg aagctcgagt 1778 
ttcattaaat atgttaagaa tcaaaaaaaa aaaaaaaa 1816 



<210> 40 
<211> 263 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> 1. .20 



<400> 40 

Met Val Ser Gly Arg Phe Tyr Leu Ser Cys Leu Leu Leu Gly Ser Leu 
-20 -15 -10 -5 

Gly Ser Met Cys He Leu Phe Thr He Tyr Trp Met Gin Tyr Trp Arg 

1 5 10 

Gly Gly Phe Ala Trp Asn Gly Ser He Tyr Met Phe Asn Trp His Pro 

15 20 25 

Val Leu Met Val Ala Gly Met Val Val Phe Tyr Gly Gly Ala Ser Leu 

30 35 40 

Val Tyr Arg Leu Pro Gin Ser Trp Val Gly Pro Lys Leu Pro Trp Lys 
45 50 55 60 
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Leu 


Leu 


His 


Ala 


Ala 


Leu 


His 


Leu 


Met 


Ala 


Phe Val 


Leu 


Thr 


Val 


Val 










65 










70 








75 




Gly 


Leu 


Val 


Ala 


Val 


Phe 


Thr 


Phe 


His 


Asn 


His Gly 


Arg 


Thr 


Ala 


Asn 








80 










85 








90 






T ~ - - 

Leu 


Tyr 


Ser 


Leu 


His 


Ser 


Trp 


Leu 


Gly 


He 


Thr Thr 


Val 


Pne 


Leu 


Phe 






95 










100 








105 








Gly 


Cys 


Gin 


Trp 


Phe 


Leu 


Gly 


Phe 


Ala 


Val 


Phe Leu 


Leu 


Pro 


Trp 


Ala 




110 










115 








120 










Ser 


Met 


Trp 


Leu 


Arg 


Ser 


Leu 


Leu 


Lys 


Pro 


lie His 


Val 


Phe 


Phe 


Gly 


125 










130 










135 








140 


Ala 


Ala 


lie 


Leu 


Ser 


Leu 


Ser 


He 


Ala 


Ser 


Val lie 


Ser 


Gly 


He 


Asn 










145 










150 








155 




Glu 


Lys 


Leu 


Phe 


Phe 


Ser 


Leu 


Lys 


Asn 


Thr 


Thr Arg 


Pro 


Tyr 


His 


Ser 








160 










165 








170 






Leu 


Pro 


Ser 


Glu 


Ala 


Val 


Phe 


Ala 


Asn 


Ser 


Thr Gly 


Met 


Leu 


Val 


Val 






175 










180 








185 








Ala 


Phe 


Gly 


Leu 


Leu 


Val 


Leu 


Tyr 


He 


Leu 


Leu Ala 


Ser 


Ser 


Trp 


Lys 




190 










195 








200 










Arg 


Pro 


Glu 


Pro 


Gly 


He 


Leu 


Thr 


Asp 


Arg 


Gin Leu 


Leu 


Leu 


Gin 


Leu 


205 










210 










215 








220 


Arg 


Pro 


Gly 


Ser 


Arg 


Pro 


Phe 


Pro 


Val 


Thr 


Tyr Val 


Ser 


Val 


Thr 


Gly 










225 










230 








235 




Arg 


Gin 


Pro 


Tyr 


Lys 


Ser 


Trp 



















240 

<210> 41 
<211> 643 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> 5'UTR 
<222> 1..90 

<220> 
<221> CDS 
<222> 91. .462 

<220>. 

<221> 3»UTR 
<222> 463. .643 

<220> 

<221> polyA_signal 
<222> 607. ,612 

<220> 

<221> polyA_site 
<222> 628. .643 

<400> 41 

acccctaccc cacgccccct cccgcgcgcg cggttaaatc cccgcacctg agcatcggct 60 
cacacctgca ccccgcccgg gcatagcacc atg cct get tgt cgc eta ggc ccg 114 

Met Pro Ala Cys Arg Leu Gly Pro 
-25 

eta gee gee gee etc etc etc age ctg ctg ctg ttc ggc ttc ace eta 162 
Leu Ala Ala Ala Leu Leu Leu Ser Leu Leu Leu Phe Gly Phe Thr Leu 

-20 -15 -10 

gtc tea ggc aca gga gca gag aag act ggc gtg tgc ccc gag etc cag 210 
Val Ser Gly Thr Gly Ala Glu Lys Thr Gly Val Cys Pro Glu Leu Gin 
-5 15 10 

get gac cag aac tgc acg caa gag tgc gtc teg gac age gaa tgc gee 258 
Ala Asp Gin Asn Cys Thr Gin Glu Cys Val Ser Asp Ser Glu Cys Ala 
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lb 


£. U 




zb 








gac 


aac 


CCC 


aag 


tgc tgc age gcg ggc tgt gee acc 


cue 


cgc 


cct 


ccg 


306 


Asp 


Asn 


ijeu 

J u 


Lys 


Cys Cys Ser Ala Gly Cys Ala Thr 


i?ne 

Aft 


Cys 


Ser 


Leu 




ccc 


aat 


gat 


aag 


g a g ggt tec tgc ccc cag gtg aac 


att 


aac 


ttt 


ccc 


354 


Piro 


Asn 


Asp 


Lys 


Glu Gly Ser Cys Pro Gin Val Asn 
50 55 


lie 


Asn 


Pne 


Pro 




cag 


etc 


ggc 


etc 


tgt egg gac cag tgc cag gtg gac 


age 


cag 


tgt 


cct 


402 


Gin 


Leu 


Gly 


Leu 


Cys Arg Asp Gin Cys Gin Val Asp 


Ser 


Gin 


Cys 


Pro 




60 








65 70 








75 




ggc 


cag 


atg 


aaa 


tgc tgc cgc aat ggc tgt ggg aag 


gtg 


tec 


tgt 


gtc 


450 


Gly 


Gin 


Met 


Lys 


Cys Cys Arg Asn Gly Cys Gly Lys 
80 .85 


Val 


Ser 


Cys 
90 


Val 




act 


ccc 


aat 


ttc 


tgagctccag ccaccaccag gctgagcagt gaggagagaa 




502 


Thr 


Pro 


Asn 


Phe 
95 















agtttctgcc tggccctgca tctggttcca gcccacctgc cctccccttt ttegggaetc 562 
tgtattccct cttgggctga ccacagcttc tccctttccc aaccaataaa gtaaccactt 622 
tcagcaaaaa aaaaaaaaaa a 643 



<210> .42 
<211> 124 
<212> PRT 

<213> Homo sapiens " 
<220> 

<221> SIGNAL 
<222> 1. .30 



<400> 42 



Met 


Pro 


Ala Cys Arg Leu 


Gly Pro 


Leu 


Ala 


Ala Ala 


Leu Leu 


Leu 


Ser 


-30 




-25 










-20 






-15 


Leu 


Leu 


Leu Phe Gly Phe 


Thr 


Leu 


Val 


Ser 


Gly Thr 


Gly Ala 


Glu 


Lys 






-10 








-5 






1 




Thr 


Gly 


Val Cys Pro Glu 


Leu 


Gin 


Ala 


Asp 


Gin Asn 


Cys Thr 


Gin 


Glu 






5 




10 








15 






Cys 


Val 


Ser Asp Ser Glu 


Cys 


Ala 


Asp 


Asn 


Leu Lys 


Cys Cys 


Ser 


Ala 




20 




25 








30 








Gly 


Cys 


Ala Thr Phe Cys 


Ser 


Leu 


Pro 


Asn 


Asp Lys 


Glu Gly 


Ser 


Cys 


35 




40 










45 






50 


Pro 


Gin 


Val Asn He Asn 


Phe 


Pro 


Gin 


Leu 


Gly Leu 


Cys Arg 


Asp 


Gin 






55 








60 






65 




Cys 


Gin 


Val Asp ser Gin 


Cys 


Pro 


Gly 


Gin 


Met Lys 


Cys. Cys 


Arg 


Asn 






70 






75 






80 






Gly 


Cys 


Gly Lys Val Ser 


Cys 


Val 


Thr 


Pro 


Asn Phe 









85 90 



<210> 43 

<211> 501 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5'UTR 
<222> 1. .227 

<220> 
<221> CDS 
<222> 228. .501 

<400> 43 

actcttactc tttctctctc actctctctc ttttcccacc ettaagecaa gtacagggat 60 
agttgtctca tcattggtgg cttaaaatga tgtttttgaa caagaagaca ccccatggga 120 
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ctgatctcaa atgcagctgt gactaaaacc tctaggtgct gtgctgtcct gaggcctggg 180 
ccatggtgcc caaggaaagc ccctgaagct caccaggagg aagaagc atg cag ggc 236 

Met Gin Gly 
-30 



apt- 


cc t 


gga 


ggc 


ggg 


au^ cgc Ctt yyy LUa itt tuo 


yty gac «-yy egg 




Thr 


"Dm 






Gly 
-25 


TTit* A r*cr Prn OTv Pro Spy* pro 
-20 


Va "1 A en S tci 21 yet 

-15 




aca 


etc 


ctg 


gtc 




age uuc auc cuy yea yea ycu 


4— +— r~f rti^rf* «aa a^/r 

t-cy ggc caa acg 






Lsu 


Leu 


val 

-10 


flit: 


oci file lie lacU. Ala Ala nla 

-5 


iJcu oiy vaXIl i v ie L. 
1 




aat 


ttc 


aca 


ggg 


gac 


cag gtt ctt cga gtc ctg gec 


aaa gat gag aag 


380 


Asn 


Phe 
5 


Thr 


Gly 


Asp 


Gin Val Leu Arg Val Leu Ala 
10 15 


Lys Asp Glu Lys 




cag 


ctt 


tea 


ctt 


etc 


ggg gat ctg gag ggc ctg aaa 


ccc cag aag gtg 


428 


Gin 


Leu 


Ser 


Leu 


Leu 


Gly Asp Leu Glu Gly Leu Lys 


Pro Gin Lys Val 




20 










25 30 


35 




gac 


ttc 


tgg 


cgt 


ggc 


cca gee agg ccc age etc cct 


gtg gat atg aga 


476 


Asp 


Phe 


Trp 


Arg 


Gly 
40 


Pro Ala Arg Pro Ser Leu Pro 
45 


Val Asp Met Arg 
50 




gtt 


cct 


ttc 


tec 


gaa 


ctg aaa gac a 




501 


Val 


Pro 


Phe 


Ser 
55 


Glu 


Leu Lys Asp 







<210> 44 








<211> 91 








<212> PRT 








<213> Homo sapiens 








<220> 








<221> SIGNAL 








<222> 1. .33 








<400> 44 








Met Gin Gly Thr Pro 


Gly Gly Gly Thr Arg Pro Gly 


Pro Ser Pro 


Val 


-30 


-25 


-20 




Asp Arg Arg Thr Leu 


Leu Val Phe Ser Phe lie Leu 


Ala Ala Ala 


Leu 


-15 


-10 


-5 




Gly Gin Met Asn Phe 


Thr Gly Asp Gin Val Leu Arg 


Val Leu Ala Lys 


1 


5 10 




15 


Asp Glu Lys Gin Leu 


Ser Leu Leu Gly Asp Leu Glu 


Gly Leu Lys 


Pro 


20 


25 


30 




Gin Lys Val Asp Phe 


Trp Arg Gly Pro Ala Arg Pro 


Ser Leu Pro 


Val 


35 


40 


45 




Asp Met Arg Val Pro 


Phe Ser Glu Leu Lys Asp 






50 


55 







<210> 45 

<211> 960 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5'UTR 
<222> 1. .97 

<220> 
<221> CDS 
<222> 98. .934 

<220> 
<221> 

<222> 935.. 960 



38 



WO 02/094864 



PCT/IB01/01715 



<400> 45 

ataatcacct ctcattccag actatgttag gtcttaatgg tgggaggacg cccgagtgct 60 
cggcccgttt caccccgagg aggaaggaca ctgggtc atg acg cca tea gaa ggc 115 

Met Thr Pro Ser Glu Gly 
1 5 



gec aga 


gca 


aaa 

yyy 


acc 


yy<* 


cgc 


y d y 


tta 


era a a a 


Liy yaw 


t*ccf efcer tt~CT 

uv-y ^ L-y u- uy 


Xw«? 


Ala Arg 


Ala 


Gl v 
vsiy 


Thr* 




m y 


V3 JL 


ucu 


V3 X U. 1*1C L. 




Opv- Tj^ll Ti»ll 

O^-L UCU. JJCU 








J. V 










15 






20 




ocr tta 


aac 
yyi 


aac 
yyc 


ctg 


y L-y 


ctg 


ctt 


caa 
oyy 


rtat" ^rr* 
yaL. uol, 


yiy yay 


t*crcf era cr aaa 

L yy y<*y yyy 


211 


Ala TtRii 


Gly 


ui y 


XlC u. 


Val 


Leu 


JJCU 


Arrr 

Airg 


ri.&JJ OCX 


Val filii 
vax ox Li 


Tm f3ln 

up vjiu uiy 






£• O 






















^yt. ay i 


I UL 


ttg 


day 


gcg 


LUl 


rrr 

gtc 


aag 


aaa r* V* 
aaa III 


yea ctg 


t*frf~ nrT<~T nart 

tgc ggg gag 




Attt Rot* 


T.pii 
AJCU 


JJCU. 


T .V/C3 


Al a 
/vi a 


ucu 


VaT 
vdl 




xjyss ocx 


& 1 a T . o i i 

Ala Xjcu 


iyt> vjxy vjiu 




ft U 










ft b 








bu 






caa gtg 


Cat 


ate 


ctg 


99 c 


cgc 


gaa 


gtg 


age gag 


gaa gag 


ttt cgt gaa 


ju / 


V3J.I1 vai 


Wi Q 

nxs 


lxc 


Leu 


o±y 


Cys 


uiu 


vaj, 


oci blu 


IjXU KjXU 


jyne Arg oxu 












o u 








bb 




/o 






gac 


ICt 


gat 


afr> 

ate 


aac 




egg 


ctg gtt 


tac cat 


gac ttc ttc 


ice 

j bb 


vj±y irne 


Asp 


Ser 


Asp 


lie 


Asn 


Asn 


Arg 


Lieu vax 


m,rv til' 

xy r hi s 


Asp Jrne jpne 










73 














Q C 

85 




cty d yaL. 


cct 


etc 


aac 


tgg 


+*r*a 


aaa 


apf 
dCL 


gag gag 


gec ttt 


CCT: 999 999 


a n q 

ft UJ 


A 1^/^*1 A ot*» 

-ttX y AHp 


Pro 


Leu. 


Asn 


Trp 


Cor 
DCl 


Lys 


llli 


blU ulU 


ax a rue 


rio oiy ijxy 
























1UU 




ny LLy 


gga 


gec 


teg 


aga 


gec 


aty 


tgc 


aag agg 


dCa gat 


cct gtt cct 


ft b x 


riU licU 


uiy 


a! a 

Ala 


Leu 


Arg 


Ala 




Cys 


Xiys Arg 


xnr Asp 


rio vai i/ro 






lUD 
















lib 






a^i/-» 

yn aCO 


all 


get 


etc 


gac 


iCa 


etc 


age 


igg cty 


cta ctt 


cgc ctt ccc 


A Q Q 

ftyy 


Vdl XttX 


Tl a 

lie 


ax a 


Leu 


Asp 


Ser 


Leu 


Ser 


Trp Leu 


Leu Leu 


Arg Leu Pro 




ion 

XA u 


















i on 
liu 






4- /-y »-» ra f-i <t 


aca 


etc 


tgc 


cag 


gtc 


ctg 


cat 


get gtg 


ayC CaL 


cag gac tct 


bft / 


Ly5 llli 




Leu 


Cys 




Vdl 


Leu 


xlXS 


Ala vax 


t>er xixs 


vjrin Asp ber 




TIC 

Ij b 








IftU 








14b 




150 




tgt cct 


9*9 1 


gac 


age 


tec 


tea 


gtg 


999 


aaa gtg 


agt gtg 


ct 9 99 c tc 9 


roc 

bi?b 




v3j.y 


Asp 


Ser 


Ser 


Ser 


Vet jL 


uiy 


Taj-o "\7aT 

xiys vcix 


oer vax 


lieu Vjriy lieu 










XDO 










lou 




lob 




eta cat 


gaa 


gag 


Clt 


Cat 


gga 


cca 


ggc 


cct gtg 


gga get 


etc age age 




Leu His 


r*l »i 
VjIU 


olU 


Leu 


xilS 


my 


Pro 


vjj.y 


rro vax 


Gly Ala 


Leu Ser Ser 1 








1 7H 

JL / U 










X / o 






10 U 




eft" crr*t* 
w» l- w» y ^* i 


fan 
lay 


au i 


nan 

gay 


a\~a 

y-y 


acc 


ctg 


ggc 


yy i an 


atg ggc 


Clan r~r f f~* fprt 
Cay y H tty 


D71 


T.pu Rl a 
ucu rvxa 


Rl n 

\J-LLL 




din 


Val 




ucu 


Glv 


vjiy -lux 


i v ic i vjxy 


m n A ~\ a Gov* 
<j1H Ala ocX 






185 
















i7 j 






acc cac 


ate 


ctg 


tgt 


caa 
iyy 




ccc 


cga 


pan rrif 


r'r'a art* 
na ai i 


/■jar pacr arh 
yai lay ai*i 


/ -J -7 


Ala His 


lie 


Leu 


Cys 






Pro 




Gin Arg 


Pro Thr 


Sen fll Tl Thr* 






















zlU 






cag tgg 


ttc 


tec 


ate 


ctt 


ccg 


gac 


ttc 


age ctg 


gat etc 


caa gag ggg 


787 


Gin Trp 


Phe 


Ser 


lie 


Leu 


Pro 


Asp 


Phe 


Ser Leu 


Asp Leu 


Gin Glu Gly 




215 








220 








225 




230 




ccc tct 


gta 


gag 


tec 


cag 


ccc 


tac 


tec 


gat cct 


cat ata 


ccc ccg gta 


835 


Pro Ser 


Val 


Glu 


Ser 


Gin 


Pro 


Tyr 


Ser 


Asp Pro 


His lie 


Pro Pro Val 










235 










240 




245 




tct aag 


aat 


gec 


aag 


gec 


aga 


aca 


agg 


aaa tgt 


agt tta 


gta tct ggt 


883 


Ser Lys 


Asn 


Ala 


Lys 


Ala 


Arg 


Thr 


Arg 


Lys Cys 


Ser Leu 


Val Ser Gly 








250 










255 






260 




cac ggg 


aga 


gaa 


aat 


aaa 


age 


tgc 


aga 


ggt tgg 


ggg tgg 


ggt cag gga 


931 


His Gly 


Arg 


Glu 


Asn 


Lys 


Ser 


Cys 


Arg 


Gly Trp 


Gly Trp 


Gly Gin Gly 






265 










270 






275 







ttc tagggatggg gcagagtggc agcatc 960 
Phe 
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<400> 46 






Met 


Thr 


Pro 


Ser 


Glu Gly Ala Arg 


1 








5 


Met 


Leu 


Asp 


Ser 


Leu Leu Ala Leu 








20 




Ser 


Val 


Glu 


Trp 


Glu Gly Arg Ser 






35 




40 


Ser 


Ala 


Leu Cys 


Gly Glu Gin Val 




50 






55 


Glu 


Glu 


Glu 


Phe 


Arg Glu Gly Phe 


65 








70 


Val 


Tyx 


His 


Asp 


Phe Phe Arg Asp 










85 


Glu 


Ala 


Phe 


Pro 


Gly Gly Pro Leu 








100 




Arg 


Thr 


Asp 


Pro 


Val Pro Val Thr 






115 




120 


Leu 


Leu 


Leu 


Arc? 


Leu Pro Cys Thr 




130 






135 


Val 


Ser 


His 


Gin 


Asp Ser Cys Pro 


145 








150 


Val 


Ser 


Val 


Leu 


Gly Leu Leu His 










165 


Val 


Gly 


Ala 


Leu 


Ser Ser Leu Ala 








180 




Thr 


Met. 


Gly 


Gin 


Ala Ser Ala His 






195 




200 


Arg 


Pro 


Thr 


Asp 


Gin Thr Gin Trp 




210 






215 


Leu 


Asp 


Leu 


Gin 


Glu Gly Pro Ser 


225 








230 


Pro 


His 


He 


Pro 


Pro Val Ser Lys 










245 


Cys 


Ser 


Leu 


Val 


Ser Gly His Gly 








260 




Trp 


Gly 


Trp 


Gly 


Gin Gly Phe 



275 



<210> 47 

<211> 1294 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5'UTR 
<222> 1. .266 

<220> 
<221> CDS 
<222> 267. .1139 

<220> 

<221> 3'UTR 
<222> 1140. .1294 

<220> 

<221> polyA_signal 
<222> 1246. .1251 

<220> 

<221> polyA_site 
<222> 1279. .1294 



Ala Gly Thr 


Gly Arg Glu 


Leu 


Glu 


10 




15 




Gly Gly Leu 


Val Leu Leu 


Arc? 


Asp 


25 


30 






Leu Leu Lys 


Ala Leu Val 


Lvs 


LVS 




45 






His He Leu 


Gly Cys Glu 


Val 


Ser 




60 






Asp Ser Asp 


He Asn Asn 


Ara 


Leu 


75 






80 


Pro Leu Asn 


Trp Ser Lys 


Thr 


Glu 


90 




95 




Gly Ala Leu 


Arg Ala Met 


Civs 


Lys 


105 


110 






He Ala Leu 


Asd Ser Leu 


Ser 






125 






Thr Leu Cys 


Gin Val Leu 


His 


Ala 




140 






Gly Asp Ser 


Ser Ser Val 


Gly 


T.vC 


155 






160 


Glu Glu Leu 


His Glv Pro 


Glv 


Pro 


170 




175 




Gin Thr Glu 


Val Thr Leu 


Glv 


Glv 


185 


190 






He Leu Cys 


Ara Ara Pro 




Gin 




205 






Phe Ser He 


Leu Pro Asp 


Phe 


Ser 




220 






Val Glu Ser 


Gin Pro Tyr 


Ser 


Asp 


235 






240 


Asn Ala Lys 


Ala Arg Thr 


Arg 


Lys 


250 




255 




Arg Glu Asn 


Lys Ser Cys 


Arg 


Gly 


265 


270 
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<400> 47 

gactctgagg ctccctcttt gctctaacag acagcagcga ctttaggctg gataatagtc 60 
aaattcttac ctcgctcttt cactgctagt aagatcagat tgcgtttctt tcagttactc 120 
ttcaatcgcc agtttcttga tctgcttcta aaagaagaag tagagaagat aaatcctgtc 180 
ttcaatacct ggaaggaaaa acaaaataac ctcaactccg ttttgaaaaa aacattccaa 240 
gaactttcat cagagatttt acttag atg att tac aca atg aag aaa gta cat 293 

Met lie Tyr Thr Met Lys Lys Val His 
-25 -20 
gca ctt tgg get tct gta tgc ctg ctg ctt aat ctt gec cct gec cct 341 
Ala Leu Trp Ala Ser Val Cys Leu Leu Leu Asn Leu Ala Pro Ala Pro 

-15 -10 -5 

ctt aat get gat tct gag gaa gat gaa gaa cac aca att ate aca gat 3 89 
Leu Asn Ala Asp Ser Glu Glu Asp Glu Glu His Thr lie He Thr Asp 

15 10 
acg gag ttg cca cca ctg aaa ctt atg cat tea ttt tgt gca ttc aag 437 
Thr Glu Leu Pro Pro Leu Lys Leu Met His Ser Phe Cys Ala Phe Lys 
15 20 25 30 

gcg gat gat ggc cca tgt aaa gca ate atg aaa aga ttt ttc ttc aat 4 85 
Ala Asp Asp Gly Pro Cys Lys Ala He Met Lys Arg Phe Phe Phe Asn 

35 40 45 

att ttc act cga cag tgc gaa gaa ttt ata tat ggg gga tgt gaa gga 533 
He Phe Thr Arg Gin Cys Glu Glu Phe He Tyr Gly Gly Cys Glu Gly 

50 55 60 

aat cag aat cga ttt gaa agt ctg gaa gag tgc aaa aaa atg tgt aca 581 
Asn Gin Asn Arg Phe Glu Ser Leu Glu Glu Cys Lys Lys Met Cys Thr 

65 70 75 

aga gaa aag cca gat ttc tgc ttt ttg gaa gaa gat cct gga ata tgt 629 
Arg Glu Lys Pro Asp Phe Cys Phe Leu Glu Glu Asp Pro Gly He Cys 

80 85 90 

cga ggt tat att acc agg tat ttt tat aac aat cag aca aaa cag tgt 677 
Arg Gly Tyr He Thr Arg Tyr Phe Tyr Asn Asn Gin Thr Lys Gin Cys 
95 100 105 110 

gaa cgt ttc aag tat ggt gga tgc ctg ggc aat atg aac aat ttt gag 725 
Glu Arg Phe Lys Tyr Gly Gly Cys Leu Gly Asn Met Asn Asn Phe Glu 

115 120 125 

aca ctg gaa gaa tgc aag aac att tgt gaa gat ggt ccg aat ggt ttc 773 
Thr Leu Glu Glu Cys Lys Asn He Cys Glu Asp Gly Pro Asn Gly Phe 

130 135 140 

cag gtg gat aat tat gga acc cag etc aat get gtg aat aac tec ctg 821 
Gin Val Asp Asn Tyr Gly Thr Gin Leu Asn Ala Val Asn Asn Ser Leu 

145 150 155 

act ccg caa tea acc aag gtt ccc age ctt ttt gaa ttt cac ggt ccc 869 
Thr Pro Gin Ser Thr Lys Val Pro Ser Leu Phe Glu Phe His Gly Pro 

160 165 170 

tea tgg tgt etc act cca gca gac aga gga ttg tgt cgt gec aat gag 917 
Ser Trp Cys Leu Thr Pro Ala Asp Arg Gly Leu Cys Arg Ala Asn Glu 
175 180 185 190 

aac aga ttc tac tac aat tea gtc att ggg aaa tgc cgc cca ttt aag 965 
Asn Arg Phe Tyr Tyr Asn Ser Val He Gly Lys Cys Arg Pro Phe Lys 

195 200 205 

tac agt gga tgt ggg gga aat gaa aac aat ttt act tec aaa caa gaa 1013 
Tyr Ser Gly Cys G1 Y G1 Y As n Asn Asn Phe Thr Ser Lys Gin Glu 

210 215 220 

tgt ctg agg gca tgt aaa aaa ggt ttc ate caa aga ata tea aaa gga 1061 
Cys Leu Arg Ala Cys Lys Lys Gly Phe He Gin Arg He Ser Lys Gly 

225 230 235 

ggc eta att aaa acc aaa aga aaa aga aag aag cag aga gtg aaa ata 1109 
Gly Leu He Lys Thr Lys Arg Lys Arg Lys Lys Gin Arg Val Lys He 

240 245 250 

gca tat gaa gaa att ttt gtt aaa aat atg tgaatttgtt atagcaatgt 1159 
Ala Tyr Glu Glu He Phe Val Lys Asn Met 
255 260 
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aacattaatt ctactaaata ttttatatga aatgtttcac tatgattttc tatttttctt 1219 
ctaaaatgct tttaattaat atgttcatta aattttctat gcttattgta cttgttacca 1279 
aaaaaaaaaa aaaaa 1294 

<210> 48 

<211> 291 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> 1. .28 



<400> 48 



Met 


He 


Tvr 
xyt 


±iii. 


M^4~ TiVO T .ires 
ncL uy □ uyo 


vaJL rllB Aid 


Leu 


Trp 


Ala 


Ser 


val 


Cys 




















-15 




Leu 


Leu 


T.aii 


Asn 


iicU Ala xrXO 


Ala JrrO LieU 


Asn 


Ala 


Asp 


Ser 


Glu 


Glu 






-10 












± 








Asp 


Glu 


Glu 


His 


Thr- T"l*» T1p» 
J. HI lie lie 


ixix Asp inr 


oJLU 


Leu 


Pro 


Pro 


Leu 


Lys 


5 








10 




ij 










2U 


Leu. 


Met 


His 




rnc uyo Hid 


■fixe .bys Aici 


Asp 


Asp 


Gly 


Pro 


Cys 


Lys 










25 












J b 




Ala 


lie 


Met 


jjys 


ZIyvt Dfio DVia 


ir ne Asn iic 


Fne 


Tnr 


Arg 


Gin 


Cys 


Glu 








40 
















Glu 


Phe 


He 




vjjiy uiy uya 


la±U la±y ASn 


o±n 


TV 

Asn 


Arg 


Pne 


Glu 


Ser 






55 




















Leu. 


Glu 


vj x u. 


v.yo 




Pirn rTl\-k -w— 7\ vr** 

vjys inr Arg 


blU 


Lys 


Pro 


Asp 


Tl"U ~ 

Pne 


Cys 




70 






7 5 






on 
OU 








Phe 


Leu 


Glu 


Glu 


Aorv TJlfrt f<lir 

J-L£»JJ rlU Vjiy 


xxe *jys Arg 




Tyr 


lie 


Thr 


Arg 


Tyr 


85 








9(1 














100 


Phe 


Tyr 


Asn 


Asn 


Gin Thr Lys 


Gin Cys Glu 


Arg 


Phe 


Lys 


Tyr 


Gly 


Gly 










105 


110 










115 




Cys 


Leu 


Gly 


Asn 


Met Asn Asn 


Phe Glu Thr 


Leu 


Glu 


Glu 


Cys 


Lys 


Asn 








120 




125 








130 




He 


Cys 


Glu 


Asp 


Gly Pro Asn 


Gly Phe Gin 


Val 


Asp 


Asn 


Tyr 


Gly 


Thr 






135 






140 






145 






Gin 


Leu 


Asn 


Ala 


Val Asn Asn 


Ser Leu Thr 


Pro 


Gin 


Ser 


Thr 


Lys 


Val 




150 






155 






160 








Pro 


Ser 


Leu 


Phe 


Glu Phe His 


Gly Pro Ser 


Trp 


Cys 


Leu 


Thr 


Pro 


Ala 


165 








170 




175 










180 


Asp 


Arg 


Gly 


Leu 


Cys Arg Ala 


Asn Glu Asn 


Arg 


Phe 


Tyr 


Tyr 


Asn 


Ser 










185 


190 










195 




Val 


He 


Gly 


Lys 


Cys Arg Pro 


Phe Lys Tyr 


Ser 


Gly 


Cys 


Gly 


Gly 


Asn 








200 




2 05 








210 






Glu 


Asn 


Asn 


Phe 


Thr Ser Lys 


Gin Glu Cys 


Leu 


Arg 


Ala 


Cys 


Lys 


Lys 






215 






220 






225 








Gly 


Phe 


He 


Gin 


Arg He Ser 


Lys Gly Gly 


Leu 


He 


Lys 


Thr 


Lys 


Arg 




230 






235 






240 










Lys 


Arg 


Lys 


Lys 


Gin Arg Val 


Lys He Ala 


Tyr 


Glu 


Glu 


He 


Phe 


Val 


245 








250 




255 










260 


Lys 


Asn 


Met 





















<210> 49 

<211> 1194 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5'UTR 
<222> 1. .47 

<220> 
<221> CDS 
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<222> 48. .1100 
<220> 

<221> 3'UTR 
<222> 1101. .1194 

<220> 

<221> polyA_signal 
<222> 1159. .1164 

<220> 

<221> polyA_site 
<222> 1179. .1194 

<400> 49 

ctcctcagct tcaggcacca ccactgacct gggacagtga atcgaca atg ccg tct 56 



Met Pro Ser 





gtc 


teg 


tgg 


ggc 


acc 


etc 


ctg 


ctg 


gca 


ggc 


ctg 


tgc tgc ctg 


gtc 


104 


Ser 


TT_ 1 

val 


Ser 


Trp 


Gly 


lie 


Leu 


Leu 


Leu 


Ala 


Gly 


Leu 


Cys Cys Leu 


Val 




_ o n 










-15 










-10 






-5 




cc t 


gtc 


tec 


ctg 


ggg 


acc 


aag 


get 


gac 


act 


cac 


gat 


gaa ate ctg 


gag 


152 


Pro 


val 


ser 


Leu 


Gly 
-i 

i. 


Thr 


Lys 


TV T _ 

Ala 


Asp 
5 


Thr 


His 


Asp 


Glu He Leu 


Glu 




yyc 


ctg 


aal 


ttc 


aac 


etc 


acg 


gag 


aut 


ccg 


gag 


get 


10 f 
cag ate cat 


gaa 


200 




Leu 


Asn 


rile 


Asn 


Leu 


mr 


talU 


lie 


Pro 


Glu 


Ala 


Gin He His 


Glu 








ID 










2. 0 










25 






ggc 




cag 


gaa 


etc 


etc 


cgt 


acc 


etc 


aac 


cag 


cca 


gac age cag 


etc 


248 


Gly 




VJlU 




Leu 


Leu 


Arg 


T'h-r- 

xiiiv 


Leu 


Asn 


lar-Ln 


Pro 


Asp Ser Gin 


Leu 
















-3D 










a n 
40 








cag 




acc 


acc 


gg c 


aat 


ggc 


* 

ctg 


LLC 


etc 


age 


gag 


ggc ctg aag 


eta 


296 


Gin 






JL LLL 




Asn. 


vjj.y 


Leu 


r*ne 


Leu 


Ser 


tjrlU 


Gly Leu Lys 


Leu 




45 










en 

3 U 














60 




ata 


gat 


aag 


ttt 


tta 
u i*y 


a^y 




y tt 


aaa 
aaa 


aag 


ctg 


taC 


cac tea gaa 


gee 


O A A 

344 


Val 


7\ dTi 


T.vo 

xjy o 


pVip 
±rXlc 


Leu 


ulU 


Asp 


Val 


Lys 


Lys 


Leu 


Tyr 


his ser Glu 


Ala 












D3 










/ U 






75 






ttc 


act 


ate 

y uv - 


aac 


ttc 


yyy 


gac 


acc 


y da 


gag 


y 


aag 


aaa. Cag atC 


aac 




Phe 


Thr 


Val 


Asn 




Ol v 
uiy 






m n 


ulU 


Ala 


Lys 


Jays vain lie 


Asn 










fin 










or> 








90 






gat 


tac 


gtg 


gaq 


aag 


Qcrt 

33" 


act 


caa 


crcrcr 


aaa 


att 


ata 

3 ^-3 


aat fcto ate 

3 3 ^3 


aag 


440 


Asp 


Tyr 


Val 


Glu 


Lys 


Gly 


Thr 


Gin 


Gly 


Lys 


He 


Val 


Asp Leu Val 


Lys 








95 










100 










105 






gag 


ctt 


gac 


aga 


gac 


aca 


gtt 


ttt 


get 


ctg 


gtg 


aat 


tac ate ttc 


ttt 


488 


Glu 


Leu 


Asp 


Arg 


Asp 


Thr 


Val 


Phe 


Ala 


Leu 


Val 


Asn 


Tyr He Phe 


Phe 






110 










115 










120 






aaa 


ggc 


aaa 


tgg 


gag 


aga 


ccc 


ttt 


gaa 


gtc 


aag 


gac 


acc gag gaa 


gag 


536 


Lys 


Gly 


Lys 


Trp 


Glu 


Arg 


Pro 


Phe 


Glu 


Val 


Lys 


Asp 


Thr Glu Glu 


Glu 




125 










130 










135 






140 




gac 


ttc 


cac 


gtg 


gac 


cag 


gtg 


acc 


acc 


gtg 


aag 


gtg 


cct atg atg 


aag 


584 


Asp 


Phe 


His 


Val 


Asp 


Gin 


Val 


Thr 


Thr 


Val 


Lys 


Val 


Pro Met Met 


Lys 












145 










150 






155 






cgt 


tta 


ggc 


atg 


ttt 


aac 


ate 


cag 


cac 


tgt 


aag 


aag 


ctg tec age 


tgg 


632 


Arg 


Leu 


Gly 


Met 


Phe 


Asn 


lie 


Gin 


His 


Cys 


Lys 


Lys 


Leu Ser Ser 


Trp 










160 










165 








170 






gtg 


ctg 


ctg 


atg 


aaa 


tac 


ctg 


ggc 


aat 


gec 


acc 


gee 


ate ttc ttc 


ctg 


680 


Val 


Leu 


Leu 


Met 


Lys 


Tyr 


Leu 


Gly 


Asn 


Ala 


Thr 


Ala 


He Phe Phe 


Leu 








175 










180 










185 






cct 


gat 


gag 


ggg 


aaa 


eta 


cag 


cac 


ctg 


gaa 


aat 


gaa 


etc acc cac 


gat 


728 


Pro 


Asp 


Glu 


Gly 


Lys 


Leu 


Gin 


His 


Leu 


Glu 


Asn 


Glu 


Leu Thr His 


Asp 






190 










195 










200 








ate 


ate 


acc 


aag 


ttc 


ctg 


gaa 


aat 


gaa 


gac 


aga 


agg 


tct gee age 


tta 


776 


lie 


He 


Thr 


Lys 


Phe 


Leu 


Glu 


Asn 


Glu 


Asp 


Arg 


Arg 


Ser Ala Ser 


Leu 




205 










210 










215 






220 




cat 


tta 


ccc 


aaa 


ctg 


tec 


att 


act 


gga 


acc 


tat 


gat 


ctg aag age 


gtc 


824 
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His 


Leu 


Pro 


LVS 


Leu 


Ser 


He 


Thr 


Gly 


Thr 


Tyr Asp 


Leu 


Lys 


Ser 


Val 












225 










230 










235 






ctg 


ggt 


caa 


ctg 


ggc 


ate 


act 


aag 


gtc 


ttc 


age 


aat 


ggg 


get 


gac 


etc 


872 


Leu 


Gly 


Gin 


Leu 
240 


Gly 


lie 


Thr 


Lvs 


Val 
245 


Phe 


OCX 


nail 


Gly 


Ala 
250 


Asp 


Leu 




tec 


ggg 


qtc 


aca 


qaq 


qaq 

-3 3 


qca 


ccc 


Ctq 

w 3 


aag 






aaa 


gee 


gtg 


cat 


920 


Ser 


Gly 


Val 
255 


Thr 


Glu 


Glu 


Ala 


Pro 
260 


Leu 


Lys 


JJC U 




Lvs 
265 


Ala 


Val 


His 




aag 


get 


gtg 


ctg 


acc 


ate 


gac 


qaq 

3 3 


aaa 


qqq 
333 


act 


gaa 


qct 


qct 


qqq 

333 


qcc 


968 


Lys 


Ala 
270 


Val 


Leu 


Thr 


He 


Asp 
275 


Glu 


Lys 


Gly 


Thr 


Glu 
280 


Ala 


Ala 


Glv 


Ala 




atg 


ttt 


tta 


qaq 


qcc 

ZD 


at a 


ccc 


atq 


tct 


ate 


ccc 


ccc 


gag 


gtc 


aag 


ttc 


1016 


Met 


Phe 


Leu 


Glu 


Ala 


He 


Pro 


Met 


Ser 


He 


Pro 


Pro 


Glu 


Val 


Lys 


IT LLC 




285 










290 










295 








300 




aac 


aaa 


ccc 


ttt 


gtc 


ttc 


tta 


atg 


att 


gac 


caa 


aat 


acc 


aag 


tct 


ccc 


1064 


Asn 


Lys 


Pro 


Phe 


Val 


Phe 


Leu 


Met 


He 


Asp 


Gin Asn 


Thr 


Lys 


Ser 


Pro 












305 










310 










315 






etc 


ttc 


atg 


gga 


aaa 


gtg 


gtg 


aat 


ccc 


acc 


caa 


aaa 


taactgcctc 




1110 


Leu 


Phe 


Met 


Gly 


Lys 


Val 


Val 


Asn 


Pro 


Thr 


Gin Lys 


















320 










325 



















tcgctcctca acccctcccc tccatccctg gccccctccc tggatgacat taaagaaggg 1170 
ttgagctgaa aaaaaaaaaa aaaa 1194 



<210> 50 
<211> 351 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> 1. .24 

<400> 50 



Met 


Pro 


Ser 


Ser 


Val 


Ser 


Trp Gly He Leu Leu Leu Ala Gly Leu Cys 










-20 




-15 -10 


Cys 


Leu 


Val 


Pro 
-5 


Val 


Ser 


Leu Gly Thr Lys Ala Asp Thr His Asp Glu 


He 


Leu 


Glu 


Gly 


Leu 


Asn 


1 5 
Phe Asn Leu Thr Glu He Pro Glu Ala Gin 




10 










15 20 


He 


His 


Glu 


Gly 


Phe 


Gin 


Glu Leu Leu Arg Thr Leu Asn Gin Pro Asp 


25 










30 


35 40 


Ser 


Gin 


Leu 


Gin 


Leu 


Thr 


Thr Gly Asn Gly Leu Phe Leu Ser Glu Gly 










45 




50 55 


Leu 


Lys 


Leu 


Val 


Asp 


Lys 


Phe Leu Glu Asp Val Lys Lys Leu Tyr His 








60 






65 70 


Ser 


Glu 


Ala 


Phe 


Thr 


Val 


Asn Phe Gly Asp Thr Glu Glu Ala Lys Lys 






75 








80 85 


Gin 


He 


Asn 


Asp 


Tyr 


Val 


Glu Lys Gly Thr Gin Gly Lys He Val Asp 




90 










95 100 


Leu 


Val 


Lys 


Glu 


Leu 


Asp 


Arg Asp Thr Val Phe Ala Leu Val Asn Tyr 


105 










110 


115 120 


He 


Phe 


Phe 


Lys 


Gly 


Lys 


Trp Glu Arg Pro Phe Glu Val Lys Asp Thr 










125 




130 135 


Glu 


Glu 


Glu 


Asp 


Phe 


His 


Val Asp Gin Val Thr Thr Val Lys Val Pro 








140 






145 150 


Met 


Met 


Lys 


Arg 


Leu 


Gly 


Met Phe Asn He Gin His Cys Lys Lys Leu 






155 








160 165 


Ser 


Ser 


Trp 


Val 


Leu 


Leu 


Met Lys Tyr Leu Gly Asn Ala Thr Ala He 




170 










175 180 


Phe 


Phe 


Leu 


Pro 


Asp 


Glu 


Gly Lys Leu Gin His Leu Glu Asn Glu Leu 


185 










190 


195 200 


Thr 


His 


Asp 


He 


He 


Thr 


Lys Phe Leu Glu Asn Glu Asp Arg Arg Ser 










205 




210 215 
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Ala 


Ser 


Leu 


His 


Tipu Pro T»vpi TiP»n 

UvU XT -L- \-> XJy O JJC LL 


OCX 


Tl a, 

1J.C 






J. 11X 




Asp 


Leu 








220 




225 










230 






Lvs 


Ser 


Val 


Leu 


Glv Gin Leu Glv 


lie 


Thr 


Lys 


Val 


Phe 


Ser 




oiy 






235 




240 










245 






Ala 


Asp 


lieu 


Ser 


Glv Val Thr Glu 


Glu 


Ala 


Prn 


Leu 




JJCU 


oex 


Lys 




250 






255 








260 








Ala 


Val 


His 


Lys 


Ala Val Leu Thr 


He 


Asp 


Glu 


Lys 


Gly 


Thr 


Glu 


Ala 


265 








270 






275 










280 


Ala 


Gly 


Ala 


Met 


Phe Leu Glu Ala 


He 


Pro 


Met 


Ser 


He 


Pro 


Pro 


Glu 










285 




290 










295 




Val 


Lys 


Phe 


Asn 


Lys Pro Phe Val 


Phe 


Leu 


Met 


He 


Asp 


Gin 


Asn 


Thr 








300 




305 










310 






Lys 


Ser 


Pro 


Leu 


Phe Met Gly Lys 


Val 


Val 


Asn 


Pro 


Thr 


Gin 


Lys 





315 320 325 



<210> 51 

<211> 1317 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5'UTR 
<222> 1. .289 

<220> 
<221> CDS 
<222> 290. .1162 

<220> 

<221> 3'UTR 
<222> 1163. .1317 

<220> 

<221> polyA_signal 
<222> 1269. .1274 

<220> 

<221> polyA_site 
<222> 1302.. 1317 

<400> 51 

aactgccagt gatctctgaa gccgactctg aggctccctc tttgctctaa cagacagcag >60 
cgactttagg ctggataata gtcaaattct tacctcgctc tttcactgct agtaagatca 120 
gattgcgttt ctttcagtta ctcttcaatc gccagtttct tgatctgctt ctaaaagaag 180 
aagtagagaa gataaatcct gtcttcaata cctggaagga aaaacaaaat aacctcaact 240 
ccgttttgaa aaaaacattc caagaacttt catcagagat tttacttag atg att tac 298 

Met He Tyr 
-25 



aca 


atg 


aag 


aaa 


gta 


cat 


gca 


ctt 


tgg 


get 


tct 


gta 


tgc 


ctg 


ctg 


ctt 


346 


Thr 


Met 


Lys 


Lys 


Val 


His 


Ala 


Leu 


Trp 


Ala 


Ser 


Val 


Cys 


Leu 


Leu 


Leu 












-20 










-15 










-10 






aat 


ctt 


gec 


cct 


gec 


cct 


ctt 


aat 


get 


gat 


tct 


gag 


gaa 


gat 


gaa 


gaa 


394 


Asn 


Leu 


Ala 


Pro 
-5 


Ala 


Pro 


Leu 


Asn 


Ala 
1 


Asp 


Ser 


Glu 


Glu 
5 


Asp 


Glu 


Glu 




cac 


aca 


att 


ate 


aca 


gat 


acg 


gag 


ttg 


cca 


cca 


ctg 


aaa 


ctt 


atg 


cat 


442 


His 


Thr 


He 


He 


Thr 


Asp 


Thr 


Glu 


Leu 


Pro 


Pro 


Leu 


Lys 


Leu 


Met 


His 






10 










15 










20 










tea 


ttt 


tgt 


gca 


ttc 


aag 


teg 


gat 


gat 


ggc 


cca 


tgt 


aaa 


gca 


ate 


atg 


490 


Ser 


Phe 


Cys 


Ala 


Phe 


Lys 


Ser 


Asp 


Asp 


Gly 


Pro 


Cys 


Lys 


Ala 


He 


Met 




25 










30 










35 










40 




aaa 


aga 


ttt 


ttc 


ttc 


aat 


att 


ttc 


act 


cga 


cag 


tgc 


gaa 


gaa 


ttt 


at a 


538 


Lys 


Arg 


Phe 


Phe 


Phe 


Asn 


He 


Phe 


Thr 


Arg 


Gin 


Cys 


Glu 


Glu 


Phe 


He 












45 










50 










55 







45 
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tat 


ggg 


qqa 


tort 


gaa 


qqa 


aat 


caq 


aat 


cga 


ttt 


gaa 


agt 


ctg 


gaa 


nan 




Tyr Gly 


Glv 


Cys 


Glu 


Glv 


Asn 


Gin 


Asn 




Phe 


Glu 


Ser 


Leu 


Glu 


ox u 










60 










65 










70 








tgc 


aaa 


aaa 


atg 


tgt 


aca 


aga 


gaa 


aaq 


cca 


qat 


ttc 


tqc 


ttt 


ttq 


gaa 


634 


Cys 


Lys 


Lys 
75 


Met 


Cys 


Thr 


Arcr 


Glu 
80 


Lvs 


Pro 


Asp 


Phe 


Cys 
85 


Phe 


Leu 


Glu 




gaa gat 


cct 


gga 


ata 


tgt 


cga 


ggt 


tat 


att 


ace 


agg 


tat 


ttt 


tat 


aac 


682 


Glu Asp 


Pro 


Gly 


He 


Cvs 


Arq 


Glv 


Tvr 


He 


Thr 


Arq 


Tvr 


Phe 


Tvr 


Asn 






90 










95 










100 












aat 


cag 


aca 


aaa 


caq 


tgt 


gaa 


cqt 


ttc 


aag 


tat 


aat 


aaa 

33° 


tgc 


ctg 


aac 

yy^- 


730 


Asn 


Gin 


Thr 


Lvs 


Gin 


Cvs 


Glu 


Ara 


Phe 


Lys 


Tvr 
j 


Glv 


Glv 


Cys 


Leu 


Glv 




105 










110 










115 










120 




aat 


atg 


aac 


aat 


ttt 




aca 


eta 


gaa 


gaa 


tgc 


aag 


aac 


att 


tgt 




i/O 


Asn 


Met 


Asn 


Asn 


Phe 
125 


Glu 


Thr 


Leu 


Glu 


Glu 
130 


Cys 


Lys 


Asn 


He 


i_ys 
135 


ui u 




gat 


ggt 


ccg 


aat 


ggt 


ttc 


cag 


gtg 


gat 


aat 


tat 


gga 


acc 


cag 


etc 


aat 


826 


Asp Gly 


Pro 


Asn 


Glv 


Phe 


Gin 


Val 


Asp 




'Wry 


Gly 






Leu 


Asn 










140 










145 










150 








get 


gtg 


aat 


aac 


tec 


ctcr 


act 


ccg 


caa 


tea 


acc 


way 


afct 




ayL 




ft7A 


Ala Val 


Asn 


Asn 


Ser 


Leu 


Thr 


Pro 


Gin 


Ser 


Thr 


T,vn 


Val 
veil 


■t i. U 




Leu 








155 










160 










165 










ttt 


gaa 


ttt 


cac 


ggt 


ccc 


tea 


tgg 


tgt 


etc 


act 


cca 


gca 


gac 


aga 


gga 


922 


Phe 


Glu 
17 0 


Phe 


His 


Glv 


Pro 


Ser 
175 




Cys 


Leu 


Thr 


nu 
180 


a 




Arg 






ttg 


tgt 


cgt 


gec 


aat 


gag 


aac 


aga 


ttc 


tac 


tac 


aat 


tea 


gtc 


att 


ggg 


970 


Leu 


Cys 


Ara 


Ala 


Asn 


Glu 


Asn 


Ara 


Phe 


j.yx 


fur 


Asn 


Ser 


Val 


-LJ.C 


uiy 




185 










190 










195 










u u 




aaa 


tgc 


cgc 


cca 


ttt 


aaa 


tac 


agt 


aaa. 


tqt 


aaa 
yyy 


aaa 
yy<* 


aat 


gaa 


aac 


aat 


101ft 

IvlO 


Lys 


Cys 


Arq 


Pro 


Phe 
205 


Lys 


Tvr 


Ser 


Glv 


210 


Gly 


Gly 


Asn 


Glu 


215 


Asn 




ttt 


act 


tec 


aaa 


caa 


gaa 


tgt 


ctg 


agg 


gca 


tgt 


aaa 


aaa 


ggt 


ttc 


ate 


1066 


Phe 


Thr 


Ser 


Lys 
220 


Gin 


Glu 


Cys 


JJC u. 


225 




Cys 


Lys 


Lys 


vjjLy 
230 


-trie 


JL-L6 




caa 


aga 


ata 


tea 


aaa 


gga 


ggc 


eta 


att 


aaa 


acc 


aaa 


aga 


aaa 


aga 


aag 


1114 


Gin Arg 


He 


Ser 


Lys 


Gly 


Gly 


Leu 


He 


Lys 


Thr 


Lys 


Arg 


Lys 


Arg 


Lys 








235 










240 










245 










aag 


cag 


aga 


gtg 


aaa 


ata 


gca 


tat 


gaa 


gaa 


att 


ttt 


gtt 


aaa 


aat 


atg 


1162 


Lys 


Gin 
250 


Arg 


Val 


Lys 


lie 


Ala 
255 


Tyr 


Glu 


Glu 


He 


Phe 
260 


Val 


Lys 


Asn 


Met 





tgaatttgtt atagcaatgt aacattaatt ctactaaata ttttatatga aatgtttcac 1222 
tatgattttc tatttttctt etaaaatget tttaattaat atgttcatta aattttctat 1282 
gcttattgta cttgttatca aaaaaaaaaa aaaaa 1317 



<210> 52 
<211> 291 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> 1..28 



<400> 52 

Met He Tyr Thr Met Lys Lys Val His Ala Leu Trp Ala Ser Val Cys 

-25 -20 -15 

Leu Leu Leu Asn Leu Ala Pro Ala Pro Leu Asn Ala Asp Ser Glu Glu 

-10 -5 1 

Asp Glu Glu His Thr He He Thr Asp Thr Glu Leu Pro Pro Leu Lys 
5 10 15 20 

Leu Met His Ser Phe Cys Ala Phe Lys Ser Asp Asp Gly Pro Cys Lys 

25 30 35 

Ala He Met Lys Arg Phe Phe Phe Asn lie Phe Thr Arg Gin Cys Glu 
4 0 45 50 
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pne 


lie 

t; c 

jj 


Tyr* 


Giy 


Giy 


cys 


Glu 
bU 


Gly 


Asn 


Gin 


Asn 


Arg 


Phe 


Glu 


Ser 


T 

Leu 


olU 

70 


uxU 


Cys 


Lys 


Lys 


wee 
75 


cys 


Thr 


Arg 


Glu 


Lys 
80 


Pro 


Asp 


Phe 


Cys 


ir He 


Leu 


rO ii 
GIU 


r*l ii 


Asp 


Pro 


Gly 


lie 


Cys 


Arg 


Gly 


Tyr 


He 


Thr 


Arg 


Tyr 


Oj 










OA 


















100 


xrlie 


Tyr 


Asn 


Asn 


r*l r-t 
bin 

1UD 


inr 


Lys 


Gin 


Cys 


GIU 
11U 


Arg 


pne 


Lys 


Tyr 


Gly 
115 


Gly 




Leu 


Giy 


Asn 




Asn 


Asn 


pne 


G1U 


inr 


Leu 


Glu 


GIU 


Cys 


Lys 


Asn 








120 










125 










130 




Tl #a 


Cys 


nl n 

V7±U. 

135 


Asp 


oiy 


rig 


Asn 


Giy 
140 


pne 


Gin 


val 


ASp 


Asn 
145 


Tyr 


Gly 


Thr 


«in 


Leu 


Asn 


ilia 


Vrtl 


Asn 


Asn 


Ser 


Leu 


Thr 


Pro 


Gin 


Ser 


Thr 


Lys 


Val 




150 










155 










160 








PlTO 


Seir 


Leu 


pne 


G1U 


pne 


HIS 


Gly 


Pro 


Ser 


Trp 


Cys 


Leu 


Thr 


Pro 


Ala 


165 










170 










175 










180 


Asp 


Arg 


Giy 


i»eu 


Cys 


Arg 


Ala 


Asn 


Glu 


Asn 

IS? U 


Arg 


Phe 


Tyr 


Tyr 


Asn 

195 


Ser 


Val 


He 


Gly 


Lys 
200 


Cys 


Arg 


Pro 


Phe 


Lys 
205 


Tyr 


Ser 


Gly 


Cys 


Gly 
210 


Gly 


Asn 


Glu 


Asn 


Asn 
215 


Phe 


Thr 


Ser 


Lys 


Gin 
220 


Glu 


Cys 


Leu 


Arg 


Ala 
225 


Cys 


Lys 


Lys 


Gly 


Phe 
230 


He 


Gin 


Arg 


He 


Ser 
235 


Lys 


Gly 


Gly 


Leu 


He 
240 


Lys 


Thr 


Lys 


Arg 


Lys 


Arg 


Lys 


Lys 


Gin 


Arg 


Val 


Lys 


He 


Ala 


Tyr 


Glu 


Glu 


He 


Phe 


Val 



245 250 255 260 

Lys Asn Met 

<210> 53 
<211> 1907 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> S'OTR 
<222> 1. .1043 

<220> 

<221> CDS 

<222> 1044. .1664 

<220> 

<221> 3'UTR 
<222> 1665. .1907 

<220> 

<221> polyA_signal 
<222> 1869. .1874 

<220> 

<221> polyA_site 
<222> 1892. .1907 

<400> 53 

caaaaaaatt ctaggtcatg atccccataa atgaagagtg atcagtccaa tcccagggaa 60 

cctggacatt ttgggtattg tttcagtgga acatgecttt cataagttcc attttcttgg 120 

gtatctctta ggaagcaagc ataggaaaca ggcccatccg tctgcctgtt ttgcttcctc 180 

atctcacttc tacacgaggg tgcctgtgct caattgetgt tttcccctaa agagactctt 240 

ttccataagt ttgtgaaatg ccatcgacaa acctgatcgc attgeattte actctgetgt 300 

tgagtcgatt tttctttatt ttatcattta gtaactcctt gctctacaga gctttcacct 360 

tccacatatt tcagattcat tctttcctaa actatgtggt ggtctacgtc ctcactgact 420 

tatcaacatg ctaccatcat gcacttccta tctctattcc tcttctttaa atttggttcc 480 

aaatggctca caccattatt ctgagctatt acctgcctac gcagtcctag aaagtaagtg 540 
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attcaggaaa cattccccaa aagtaaagtt tctcaggtaa gatcagaaga ctcccatgag 600 

tcactgctgc tcaggatcac atctggctcc ttgaagagtg attcatcaga ccttacatag 660 

atcttgtcat aaaaatgaaa gaggcctcgg gggaaggtct tgggctggtg gcttctgttg 720 

gagtcctggg ctgtggggtg aaagccgtgg ctgtagagct tcatgcggag ttacttagct 780 

ttgctctcct gtggacaggc catgcctgtg cctcccccaa gcatcggaaa aattggcata 840 

gatgggccct tctcaaaaat cccactcctg gagcactggc caaaattact accatcctga 900 

tgctgggctt gcagtccttt cctttgggaa tatgaacatg gtcaaaatta agtgaacgtg 960 

tctttctggc tttctgtaca atggagcaga acaaagtatc aatttaacta aaatttgaac 1020 

taaatcctct ttccaggttt gga atg cac ttc tgt gga ggc acc ttg ata tec 1073 

Met His Phe CyB Gly Gly Thr Leu He Ser 















1 








5 










10 




cca 


gag 


tgg 


gtg 


ttg 


act 


get 


gec 


cac 


tgc 


ttg 


gag 


aag 


tec 


cca 


agg 


1121 


Pro 


Glu 


Trp 


Val 


Leu 


Thr 


Ala 


Ala 


His 


Cys 


Leu 


Glu 


Lys 


Ser 


Pro 


Arg 












15 










20 










25 






cct 


tea 


tec 


tac 


aag 


gtc 


ate 


ctg 


ggt 


gca 


cac 


caa 


gaa 


gtg 


aat 


etc 


1169 


Pro 


Ser 


Ser 


Tyr 


Lys 


Val 


He 


Leu 


Gly 


Ala 


His 


Gin 


Glu 


Val 


Asn 


Leu 










30 










35 










40 








gaa 


ccg 


cat 


gtt 


cag 


gaa 


ata 


gaa 


gtg 


tct 


agg 


ctg 


ttc 


ttg 


gag 


ccc 


1217 


Glu 


Pro 


His 


Val 


Gin 


Glu 


He 


Glu 


Val 


Ser 


Arg 


Leu 


Phe 


Leu 


Glu 


Pro 








45 










50 










55 










aca 


cga 


aaa 


gat 


att 


gee 


ttg 


eta 


aag 


eta 


age 


agt 


cct 


gee 


gtc 


ate 


1265 


Thr 


Arg 


Lys 


Asp 


He 


Ala 


Leu 


Leu 


Lys 


Leu 


Ser 


Ser 


Pro 


Ala 


Val 


He 






60 










65 










70 












act 


gac 


aaa 


gta 


ate 


cca 


get 


tgt 


ctg 


cca 


tec 


cca 


aat 


tat 


gtg 


gtc 


1313 


Thr 


Asp 


Lys 


Val 


He 


Pro 


Ala 


Cys 


Leu 


Pro 


Ser 


Pro 


Asn 


Tyr 


val 


Val 




75 










80 










85 










90 




get 


gac 


egg 


acc 


gaa 


tgt 


ttc 


ate 


act 


ggc 


tgg 


gga 


gaa 


acc 


caa 


ggt 


1361 


Ala 


Asp 


Arg 


Thr 


Glu 


Cys 


Phe 


lie 


Thr 


Gly 


Trp 


Gly 


Glu 


Thr 


Gin 


Gly 












95 










100 










105 






act 


ttt 


gga 


get 


ggc 


ctt 


etc 


aag 


gaa 


gee 


cag 


etc 


cct 


gtg 


att 


gag 


1409 


Thr 


Phe 


Gly 


Ala 


Gly 


Leu 


Leu 


Lys 


Glu 


Ala 


Gin 


Leu 


Pro 


Val 


He 


Glu 










110 










115 










120 








aat 


aaa 


gtg 


tgc 


aat 


cgc 


tat 


gag 


ttt 


ctg 


aat 


gga 


aga 


gtc 


caa 


tec 


1457 


Asn 


Lys 


Val 


Cys 


Asn 


Arg 


Tyr 


Glu 


Phe 


Leu 


Asn 


Gly 


Arg 


Val 


Gin 


Ser 








125 










130 










135 










acc 


gaa 


etc 


tgt 


get 


ggg 


cat 


ttg 


gec 


gga 


ggc 


act 


gac 


agt 


tgc 


cag 


1505 


Thr 


Glu 


Leu 


Cys 


Ala 


Gly 


His 


Leu 


Ala 


Gly 


Gly 


Thr 


Asp 


Ser 


Cys 


Gin 






14 0 










145 










150 












ggt 


gac 


agt 


gga 


ggt 


cct 


ctg 


gtt 


tgc 


ttc 


gag 


aag 


gac 


aaa 


tac 


att 


1553 


Gly 


Asp 


Ser 


Gly 


Gly 


Pro 


Leu 


Val 


Cys 


Phe 


Glu 


Lys 


Asp 


Lys 


Tyr 


He 




155 










160 










165 










170 




tta 


caa 


gga 


gtc 


act 


tct 


tgg 


ggt 


ctt 


ggc 


tgt 


gca 


cgc 


•ccc 


aat 


aag 


1601 


Leu 


Gin 


Gly 


Val 


Thr 


Ser 


Trp 


Gly 


Leu 


Gly 


Cys 


Ala 


Arg 


Pro 


Asn 


Lys 












175 










180 










185 






cct 


ggt 


gtc 


tat 


gtt 


cgt 


gtt 


tea 


agg 


ttt 


gtt 


act 


tgg 


att 


gag 


gga 


1649 


Pro 


Gly 


Val 


Tyr 


Val 


Arg 


Val 


Ser 


Arg 


Phe 


Val 


Thr 


Trp 


lie 


Glu 


Gly 





190 195 200 

gtg atg aga aat aat taattggacg ggagacagag tgacgcactg actcacctag 1704 
Val Met Arg Asn Asn 
205 

aggctggaac gtgggtaggg atttagcatg ctggaaataa ctggcagtaa tcaaacgaag 1764 
acactgtccc cagctaccag etatgecaaa cctcggcatt ttttgtgtta ttttctgact 1824 
gc tgg at tct gtagtaaggt gacatagcta tgacatttgt taaaaataaa ctctgtactt 1884 
aactttgaaa aaaaaaaaaa aaa 1907 

<210> 54 

<211> 207 

<212> PRT 

<213> Homo sapiens 

<400> 54 

Met His Phe Cys Gly Gly Thr Leu He Ser Pro Glu Trp Val Leu Thr 
1 5 10 15 
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Ala 


Ala 


His 


Cys 
20 


Leu 


Glu Lys Ser 


Pro 
25 


Arg 


Pro 


Ser 


Ser 


Tyr 
30 


Lys 


Val 


xxe 


Leu 


Gly 
35 


Ala 


His 


Gin Glu Val 
40 


Asn 


Leu 


Glu 


Pro 


His 
45 


Val 


Gin 


Glu 


A X& 


GIU 

50 


Val 


Ser 


Arg 


Leu Phe Leu 
55 


Glu 


Pro 


Thr 


Arg 
60 


Lys 


Asp 


He 


Ala 


T on 

i»eu 


Leu 


Lys 


Leu 


Ser 


Ser Pro Ala 


Val 


He 


Thr 


Asp 


Lys 


Val 


He 


Pro 


65 
















75 










80 


Ala 


Cys 


Leu 


Pro 


Ser 


Pro Asn Tyr 


Val 


Val 

o r\ 
90 


7, T _ 

Ala 


Asp 


Arg 


Thr 


Glu 
95 


Cys 


Phe 


lie 


Thr 


Gly 
100 


Trp 


Gly Glu Thr 


Gin 
105 


Gly 


Thr 


Phe 


Gly 


Ala 
110 


Gly 


Leu 


Leu 


Lys 


Glu 

T 1 C 

JLJ.5 


Ala 


Gin 


Leu Pro Val 
120 


He 


Glu 


Asn 


Lys 


Val 
125 


Cys 


Asn 


Arg 


Tyr Glu 


Phe 


Leu 


Asn 


Gly Arg Val 


Gin 


Ser 


Thr 


Glu 


Leu 


Cys 


Ala 


Gly 




130 








135 








140 








His 


Leu 


Ala 


Gly 


Gly 


Thr Asp Ser 


Cys 


Gin 


Gly 


Asp 


Ser 


Gly 


Gly 


Pro 


145 










150 






155 










160 


Leu 


Val 


Cys 


Phe 


Glu 
165 


Lys Asp Lys 


Tyr 


He 
170 


Leu 


Gin 


Gly 


Val 


Thr 
175 


Ser 


Trp Gly 


Leu 


Gly 


Cys 


Ala Arg Pro 


Asn 


Lys 


Pro 


Gly 


Val 


Tyr 


Val 


Arg 








180 






185 










190 






Val 


Ser 


Arg 
195 


Phe 


Val 


Thr Trp He 
200 


Glu 


Gly 


Val 


Met 


Arg 
205 


Asn 


Asn 





<210> 55 

<211> 809 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5'UTR 
<222> 1. .25 

<220> 

<221> CDS ■ 
<222> 26. .628 

<220> 

<221> 3'UTR 
<222> 629. .809 

<220> 

<221> polyA_signal 
<222> 766. .771 

<220> 

<221> polyA_site 
<222> 795. .809 

<400> 55 



agaaaggtgt ggttggcatg gggca atg 


Ctt 


gag 


gta 


tea 


gat 


gca 


ctg 


gga 


52 




Met 
1 


Leu 


Glu 


Val 


Ser 
5 


Asp 


Ala 


Leu 


Gly 




gga cct gga aga gta cca 


ggg gec 


aca 


gca 


ggg 


atg 


aat 


gga 


gtg 


gac 


100 


Gly Pro Gly Arg Val Pro 


Gly Ala 


Thr 


Ala 


Gly 


Met 


Asn 


Gly 


Val 


Asp 




10 15 








20 










25 




acg teg ctt etc tgt gat 


ttg ttg 


cag 


gec 


ctg 


acc 


ttc 


ctg 


acc 


aga 


148 


Thr Ser Leu Leu Cys Asp 


Leu Leu 


Gin 


Ala 


Leu 


Thr 


Phe 


Leu 


Thr 


Arg 




30 






35 










40 




aat gaa att ctg tgc ate 


cat gac 


acc 


ttc 


ctg 


aag 


etc 


tgc 


cct 


cct 


196 


Asn Glu He Leu Cys He 


His Asp 


Thr 


Phe 


Leu 


Lys 


Leu 


Cys 


Pro 


Pro 




45 




50 










55 
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ggg aag 


cac 


cac 


aag 


gag 


gca 


acg 


etc 


acc 


atg 


gac 


cag 


gec 


age 


tec 


244 


fll T rn 

oiy JjyS 


Tyr 


Tyr 


Lys 


VjlU. 


Til =» 

Aia 


Thr 

O D 


T All 

lieu 


Thr 


wet 


Asp 


70 


va± 


Ser 


Ser 




ctg cca 


get 


ctg 


egg 


gtc 


aac 


cct 


ttc 


aga 


gac 


cgt 


ate 


tgc 


aga 


gtg 


292 


TiAH 

JJCU riO 


Ala 


Leu 


Arg 


VdJl 


Asn 


Pro 


irne 


Arg 


Asp 


Arg 




Cys 


Arg 


vai 














O v 






















ttc tec 


cac 


aaa 


ggc 


atg 


ttc 


tec 


ttt 


gag 


gat 


gtg 


ctg 


ggc 


atg 


gca 


340 


DVio Car 
xr lit: OCX. 




Lys 


uiy 


Mat* 


DVto 
ir£ie 


Ser 


rile 


n~\ ii 
uiu 


Asp 


val 


Leu 


^jiy 


wet 


Aia 




90 


















i no 










i ft t* 

1UD 




tct gtg 


ttc 


age 


gag 


cag 


gec 


tgc 


cca 


age 


ctg 


aag 


att 


gag 


tat 


gec 


388 


oer vai 


file 


Ser 


uiu 

ii n 
ii u 


ply, 


Aia 


Cys 


Pro 


Ser 
115 


Leu 


Lys 


lie 


rii 

GIU 


Tyr 

n n 
1Z 0 


Ala 




ttt cgc 


ate 


f- a 4- 


gac 


Cut 


aac 


gag 


aat 


ggc 


tec 


arc 


gat 


gag 


gag 


gat 


A ~1 C 


fne Arg 


lie 


Tyr 

IOC 


Asp 


irne 


Asn 




Asn 

1 


Ljiy 


pne 


lie 


Asp 


GlU 

T *3 C 

lib 


Glu 


Asp 




ctg cag 


agg 


acc 


ate 


_ 4_ _ 
ctg 


cga 


ctg 


ctg 


aac 


agt 


gat 


gac 


atg 


tct 


gag 


484 


Leu Gin 


Arg 

ItlU 


lie 


He 


Leu 


Arg 


Leu 


Leu 


Asn 


Ser 


Asp 


Asp 

150 


Met 


Ser 


Glu 




gac etc 


ctg 


atg 


gac 


etc 


acg 


aac 


cac 


gtc 


ctg 


agt 


gag 


teg 


gat 


ctg 


532 


Asp Leu 


Leu 


Met 


Asp 


Leu 


Thr 


Asn 


His 


Val 


Leu 


Ser 


Glu 


Ser 


Asp 


Leu 




155 










160 










165 












gac aat 


gac 


aac 


atg 


ctg 


tec 


ttc 


tea 


gag 


ttt 


gaa 


cat 


gca 


atg 


gec 


580 


Asp Asn 


Asp 


Asn 


Met 


Leu 


Ser 


Phe 


Ser 


Glu 


Phe 


Glu 


His 


Ala 


Met 


Ala 




170 








175 










180 










185 




aag tct 


cca 


gat 


ttc 


atg 


aac 


tec 


ttt 


egg 


att 


cac 


ttc 


tgg 


gga 


tgc 


628 


Lys Ser 


Pro 


Asp 


Phe 
190 


Met 


Asn 


Ser 


Phe 


Arg 
195 


He 


His 


Phe 


Trp 


Gly 
200 


Cys 





tgatgtagcg gcaaatacct gacatggcag cctcgaggga gaccacagga atcgaacccc 688 

ctccagcact ggagggagct ggtttgaagt atgactttgt actgggccca cactcacctc 748 

tagaatattg tttattagat aaaagaaaaa gcttttcctt ageccgaaaa aaaaaaaaaa 808 

t 809 



<210> 56 
<211> 201 
<212> PRT 

<213> Homo sapiens 
<400> 56 



Met 


Leu 


Glu 


Val 


Ser 


Asp 


Ala 


Leu 


Gly 


Gly 


Pro 


Gly 


Arg 


Val 


Pro 


Gly 


1 








5 










10 










15 




Ala 


Thr 


Ala 


Gly 
20 


Met 


Asn 


Gly 


Val 


Asp 
25 


Thr 


Ser 


Leu 


Leu 


Cys 
30 


Asp 


Leu 


Leu 


Gin 


Ala 
35 


Leu 


Thr 


Phe 


Leu 


Thr 
40 


Arg 


Asn 


Glu 


lie 


Leu 
45 


Cys 


He 


His 


Asp 


Thr 
50 


Phe 


Leu 


Lys 


Leu 


Cys 
55 


Pro 


Pro 


Gly 


Lys 


Tyr 
60 


Tyr 


Lys 


Glu 


Ala 


Thr 


Leu 


Thr 


Met 


Asp 


Gin 


Val 


Ser 


Ser 


Leu 


Pro 


Ala 


Leu 


Arg 


Val 


Asn 


65 










70 










75 










80 


Pro 


Phe 


Arg 


Asp 


Arg 
85 


lie 


Cys 


Arg 


Val 


Phe 
90 


Ser 


His 


Lys 


Gly 


Met 
95 


Phe 


Ser 


Phe 


Glu 


Asp 
100 


Val 


Leu 


Gly 


Met 


Ala 
105 


Ser 


Val 


Phe 


Ser 


Glu 
110 


Gin 


Ala 


Cys 


Pro 


Ser 
115 


Leu 


Lys 


He 


Glu 


Tyr 
120 


Ala 


Phe 


Arg 


He 


Tyr 
125 


Asp 


Phe 


Asn 


Glu 


Asn 
130 


Gly 


Phe 


lie 


Asp 


Glu 
135 


Glu 


Asp 


Leu 


Gin 


Arg 
140 


lie 


He 


Leu 


Arg 


Leu 


Leu 


Asn 


Ser 


Asp 


Asp 


Met 


Ser 


Glu 


Asp 


Leu 


Leu 


Met 


Asp 


Leu 


Thr 


145 










150 










155 










160 


Asn 


His 


Val 


Leu 


Ser 


Glu 


Ser 


Asp 


Leu 


Asp 


Asn 


Asp 


Asn 


Met 


Leu 


Ser 










165 










170 








175 




Phe 


Ser 


Glu 


Phe 
180 


Glu 


His 


Ala 


Met 


Ala 
185 


Lys 


Ser 


Pro 


Asp 


Phe 
190 


Met 


Asn 


Ser 


Phe 


Arg 


He 


His 


Phe 


Trp 


Gly 


Cys 
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195 200 

<210> 57 

<211> 1133 

<212> DNA 

<213> Homo Bapiens 

<220> 

<221> 5 f XJTR 
<222> 1. .475 

<220> 
<221> CDS 
<222> 476. .964 

<220> 

<221> 3'UTR 
<222> 965. .1133 

<220> 

<221> polyA_signal 
<222> 1101. .1106 

<220> 

<221> polyA_site 
<222> 1118. .1133 

<400> 57 

gacataatca gagctatgct ggaggagaag agggcagcca tttgctggct ggcttgcagt 60 

gagccaggag gtggcaggac gagttaggag gctggttcag tagctcgggc aagagcaggg 120 

ccccccagga tctgaaggcc tcccaggccc cccaggccca gcgggtccca gaggagagcg 180 

aggaccccaa ggtaactccg gtgagaaggg cgaccaggga tttcaaggcc agccaggctt 240 

tccgggccca ccgggtcccc ctggattccc aggcaaagtt ggatcacctg gcccacctgg 300 

ccctcaagca gagaagggca gcgaagggat tcgaggccca tcaggcctgc ctggctcccc 360 

tgggccaccg ggacctcctg ggattcaggg ccccgccggt ctggatggtt tggatgggaa 420 

ggatggcaag cctggcttga ggggggaccc tggtcctgct ggcccccctg gactc atg 478 

Met 



gga 

Gly 


cca 
Pro 


ccg 
Pro 


ggc 
Gly 
5 


ttt 
Phe 


aag 
Lys 


ggg 

Gly 


aaa 

Lys 


aca 
Thr 
10 


gga 
Gly 


cat 
Hie 


cct 
Pro 


ggc 

Gly 


etc 
Leu 
15 


cca 
Pro 


i 

gga 
Gly 


526 


cct 
Pro 


aag 
Lys 


ggt 

Gly 
20 


gac 
Asp 


tgt 
Cys 


ggc 
Gly 


aaa 
Lys 


cca 
Pro 
25 


ggt 

Gly 


cct 
Pro 


cct 
Pro 


ggc 
Gly 


age 
Ser 
30 


act 
Thr 


ggc 

Gly 


egg 
Arg 


574 


cct 
Pro 


ggc 
Gly 
35 


gca 
Ala 


gag 
Glu 


ggt 

Gly 


gaa 
Glu 


cct 
Pro 
40 


ggt 

Gly 


gec 
Ala 


atg 
Met 


gga 
Gly 


ccc 
Pro 
45 


cag 
Gin 


gga 

Gly 


aga 
Arg 


ccc 
Pro 


622 


ggt 
Gly 
50 


ccc 
Pro 


ccg 
Pro 


gga 
Gly 


cac 
His 


gtt 
Val 
55 


ggg 

Gly 


cca 
Pro 


cca 
Pro 


ggg 

Gly 


cct 
Pro 
60 


cca 
Pro 


ggc 

Gly 


cag 
Gin 


cca 
Pro 


gga 

Gly 
65 


670 


cca 
Pro 


get 
Ala 


ggg 

Gly 


ate 
He 


tct 
Ser 
70 


gca 
Ala 


gtg 

Val 


ggt 
Gly 


ctg 
Leu 


aaa 
Lys 
75 


gga 
Gly 


gac 
Asp 


cga 
Arg 


gga 

Gly 


gec 
Ala 
80 


acc 
Thr 


718 


gga 

Gly 


gaa 
Glu 


agg 
Arg 


ggc 
Gly 
85 


ctt 
Leu 


gca 
Ala 


ggc 

Gly 


etc 
Leu 


cca 
Pro 
90 


ggc 

Gly 


cag 
Gin 


ccc 
Pro 


ggc 

Gly 


ccc 
Pro 
95 


cca 
Pro 


ggt 
Gly 


766 


cct 
Pro 


caa 
Gin 


ggt 

Gly 
100 


cct 
Pro 


cca 
Pro 


ggc 

Gly 


tat 
Tyr 


ggc 

Gly 
105 


aag 
Lys 


atg 
Met 


ggt 

Gly 


gca 
Ala 


aca 
Thr 
110 


gga 
Gly 


cca 
Pro 


atg 
Met 


814 


ggc 
Gly 


cag 
Gin 
115 


caa 
Gin 


ggc 
Gly 


ate 
He 


cct 
Pro 


ggc 

Gly 
120 


ate 
He 


cct 
Pro 


ggg 

Gly 


ccc 
Pro 


ccg 
Pro 
125 


ggt 
Gly 


ccc 
Pro 


atg 
Met 


ggc 

Gly 


862 


cag 


cca 


ggc 


aag 


get 


ggc 


cac 


tgt 


aat 


ccc 


tct 


gac 


tgc 


ttt 


ggg 


gec 


910 
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Gin Pro Gly Lys Ala Gly His Cys Asn Pro Ser Asp Cys Phe Gly Ala 

130 135 140 145 

atg ccg atg gag cag cag tac cca ccc atg aaa acc atg aag ggg cct 958 

Met Pro Met Glu Gin Gin Tyr Pro Pro Met Lys Thr Met Lys Gly Pro 

150 155 * 160 

ttt ggc tgaaattccc cacctgcctt tggatgaaag actccgttgg gaataaatgg 1014 
Phe Gly 

ccaaagctta taggactctg tgacaggttg tgaatgtttt ttttgttgtt gttgttgttt 1074 
ttaattgctg ttaatatttt ttaaataata aagaaacaaa actaaaaaaa aaaaaaaaa 1133 

<210> 58 

<211> 163 

<212> PRT 

<213> Homo sapiens 



<400> 58 



Met 


Gly 


Pro 


Pro 


Gly Phe Lys 


Gly Lys Thr Gly His 


Pro 


Gly 


Leu 


Pro 


1 








5 


10 






15 




Gly 


Pro 


Lys 


Gly 


Asp Cys Gly 


Lys Pro Gly Pro Pro 


Gly 


Ser 


Thr 


Gly 








20 




25 




30 




Arg 


Pro 


Gly 


Ala 


Glu Gly Glu 


Pro Gly Ala Met Gly 


Pro 


Gin 


Gly 


Arg 






35 






40 


45 








Pro 


Gly 


Pro 


Pro 


Gly His Val 


Gly Pro Pro Gly Pro 


Pro 


Gly 


Gin 


Pro 




50 






55 


60 










Gly 


Pro 


Ala 


Gly 


lie Ser Ala 


Val Gly Leu Lys Gly 


Asp 


Arg 


Gly 


Ala 


65 








70 


75 








80 


Thr 


Gly 


Glu 


Arg 


Gly Leu Ala 


Gly Leu Pro Gly Gin 


Pro 


Gly 


Pro 


Pro 










85 


90 






95 




Gly 


Pro 


Gin 


Gly 


Pro Pro Gly 


Tyr Gly Lys Met Gly 


Ala 


Thr 


Gly 


Pro 








100 




105 




110 






Met 


Gly 


Gin 


Gin 


Gly lie Pro 


Gly lie Pro Gly Pro 


Pro 


Gly 


Pro 


Met 






115 






120 


125 








Gly 


Gin 


Pro 


Gly 


Lys Ala Gly 


His Cys Asn Pro Ser 


Asp 


Cys 


Phe 


Gly 




130 






135 


140 








-Ala 


Met 


Pro 


Met 


Glu Gin Gin 


Tyr Pro Pro Met Lys 


Thr 


Met 


Lys 


Gly 


145 








150 


155 








160 


Pro 


Phe 


Gly 

















<210> 59 

<211> 838 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5'UTR 
<222> 1. .78 

<220> 
<221> CDS 
<222> 79. .642 

<220> 

<221> 3'UTR 
<222> 643. .838 

<220> 

<221> polyA_signal 
<222> 799. .804 

<220> 

<221> polyA_site 
<222> 823. .838 
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<400> 59 

aaagactgcg tgcagaaggt gactgtctca gtggagctgg gtcatctcag gccttggctc 60 

111 



159 



cttgaacttt tggccgcc 


atg 


tgc 


ttc 


ccg 


aag 


gtc 


etc 


tct 


gat 


gac 


acg 










Met 


Cys 


Phe 


Pro 


Lys 


Val 


Leu 


Ser Asp 


Asp 


Met 










1 








5 










10 




aag aag 


ctg 


aag 


gec 


cga 


atg 


cac 


cag 


gee 


ata 


gaa 


aga 


ttt 


cac 


gat 


Lys Lys 


Leu 


Lys 


Ala 


Arg 


Met 


His 


Gin 


Ala 


lie 


Glu 


Arg Phe 


Tyr 


Asp 






15 










20 










25 






aaa atg 


caa 


aat 


gca 


gaa 


4- ti ~i 

tea 


gga 


cgt 


gga 


cag gtg 


atg teg 


age 


ctg 


Lys Met 


Gin 


Asn 


Ala 


GlU 


Ser 


Gly Arg 


Gly 


Gin Val 


Met 


Ser 


Ser 


Leu 




30 










35 










40 








gca gag 


ctg 


gag 


gac 


gac 


4- Im „ 

ttc 


aaa 


gag 


ggc 


tac 


ctg 


gag 


aca 


gtg 


gcg 


Ala Glu 


Leu 


Glu 


Asp 


Asp 


Phe 


Lys Glu 


Gly 


Tyr Leu 


Glu 


Thr 


Val 


Ala 


45 










50 










55 










get tat 


tat 


gag 


gag 


cag 


cac 


cca 


gag 


etc 


act 


cct 


eta 


ctt 


gaa 


aaa 


Ala Tyr 


Tyr 


Glu 


Glu 


Gin 


His 


Pro 


Glu 


Leu 


Thr 


Pro 


Leu 


Leu 


Glu 


Lys 


60 








65 










70 










75 


gaa aga 


gat 


gga 


tta 


egg 


tgc 


cga 


ggc 


aac 


aga 


tec 


cct 


gtc 


ccg 


gat 


Glu Arg 


Asp 


Gly 


Leu 


Arg 


Cys 


Arg Gly 


Asn 


Arg 


Ser 


Pro Val 


Pro 


Asp 








80 










85 










90 




gtt gag 


gat 


ccc 


gca 


acc 


gag 


gag 


cct 


ggg 


gag 


age 


ttt 


tgt 


gac 


aag 


Val Glu 


Asp 


Pro 
95 


Ala 


Thr 


Glu 


Glu 


Pro 
100 


Gly 


Glu 


Ser 


Phe 


Cys 
105 


Asp 


Lys 


gtc atg 


aga 


tgg 


ttc 


cag 


gee 


atg ctg 


cag 


egg 


ctg 


cag 


acc 


tgg 


tgg 


Val Met 


Arg 


Trp 


Phe 


Gin 


Ala 


Met 


Leu 


Gin 


Arg Leu 


Gin 


Thr 


Trp 


Trp 




110 










115 










120 








cac ggg 


gtt 


ctg 


gec 


tgg 


gtg 


aag 


gag 


aag 


gtg 


gtg 


gec 


ctg 


gtc 


cat 


His Gly 


Val 


Leu 


Ala 


Trp 


vax 


Lys 


Glu 


T •» rn 

Lys 


Val 


Val 


Ala 


Leu 


vax 


J11S 


125 










130 










135 










gca gtg 


cag 


gec 


etc 


tgg 


aaa 


cag 


ttc 


cag 


agt 


ttc 


tgc tgc 


tct 


ctg 


Ala Val 


Gin 


Ala 


Leu 


Trp 


Lys 


Gin 


Phe 


Gin 


Ser 


Phe 


Cys 


Cys 


Ser 


Leu 


14 0 








145 










150 










155 


tea gag 


etc 


ttc 


atg 


tec 


tct 


ttc 


cag 


tec 


tac 


gga 


gee 


cca 


egg 


ggg 


Ser Glu 


Leu 


Phe 


Met 


Ser 


Ser 


Phe 


Gin 


Ser 


Tyr Gly 


Ala 


Pro 


Arg 


Gly 








160 










165 








9 


170 




gac aag 


gag 


gag 


ctg 


aca 


ccc 


cag 


aag 


tgc 


tct 


gaa 


ccc 


caa 


tec 


tea 


Asp Lys 


Glu 


Glu 


Leu 


Thr 


Pro 


Gin Lys 


Cys 


Ser 


Glu 


Pro 


Gin 


Ser 


Ser 



207 



255 



303 



351 



399 



447 



495 



543 



591 



639 

Ser Ser 

175 180 185 

aaa tgaagatact gacaccacct ttgccctccc cgtcaccgcg cacccaccct 692 
Lys 

gacccctccc tcagctgtcc tgtgccccgc cctctcccgc acactcagtc cccctgcctg 752 
gcgttcctgc cgcagctctg acctggtgct gtcgccctgg catcttaata aamcctgett 812 
atacttccct aaaaaaaaaa aaaaaa 838 



<210> 60 

<211> 188 

<212> PRT 

<213> Homo sapiens 



<400> 60 



Met 


Cys Phe 


Pro 


Lys 


Val 


Leu 


Ser Asp 


Asp 


Met 


Lys 


Lys 


Leu 


Lys 


Ala 


1 






5 








10 










15 




Arg 


Met His 


Gin 


Ala 


lie 


Glu 


Arg Phe 


Tyr 


Asp 


Lys 


Met 


Gin 


Asn 


Ala 






20 








25 










30 






Glu 


Ser Gly 


Arg 


Gly 


Gin 


Val 


Met Ser 


Ser 


Leu 


Ala 


Glu 


Leu 


Glu 


Asp 




35 










40 








45 








Asp 


Phe Lys 


Glu 


Gly 


Tyr 


Leu 


Glu Thr 


Val 


Ala 


Ala 


Tyr 


Tyr 


Glu 


Glu 




50 








55 








60 










Gin 


His Pro 


Glu 


Leu 


Thr 


Pro 


Leu Leu 


Glu 


Lys 


Glu 


Arg 


Asp 


Gly 


Leu 


65 








70 








75 










80 


Arg 


Cys Arg 


Gly 


Asn 


Arg 


Ser 


Pro Val 


Pro 


Asp 


Val 


Glu 


Asp 


Pro 


Ala 








85 








90 










95 




Thr 


Glu Glu 


Pro 


Gly 


Glu 


Ser 


Phe Cys 


Asp 


Lys 


Val 


Met 


Arg 


Trp 


Phe 
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100 






Gin Ala 


Met 


T All 

Leu 


cm 


Arg L«eu Gin 




113 






-L jL, W 


Trp Val 


Lys 


Glu 


Lys 


Val Val Ala 


130 








135 


Trp Lys 


Gin 


Phe 


Gin 


Ser Phe Cys 


145 








150 


Ser Ser 


Phe 


Gin 


Ser 


Tyr Gly Ala 








165 




Thr Pro 


Gin 


Lys 


Cys 


Ser Glu Pro 



180 







110 


Thr 


Trp 


Trp His Gly Val Leu Ala 






125 


Leu 


Val 


His Ala Val Gin Ala Leu 






14 0 


Cys 


Ser 


Leu Ser Glu Leu Phe Met 






155 160 


Pro 


Arg 


Gly Asp Lys Glu Glu Leu 




170 


175 


Gin 


Ser 


Ser Lys 


185 







<210> 61 

<211> 862 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> 5 ! UTR 
<222> 1. .158 



<220> 
<221> CDS 
<222> 159. .764 

<220> 

<221> 3'UTR 
<222> 765. .862 



<400> 61 

attttttttt ttggcacgcc tgcagccaag ttggggaggg tttcctggac agaggtcctt 60 
tggctgctgc ettaagaegt gcagcctggg ccgtggctgt cactgcgttc ggacccagac 120 
ccgctgcagg cagcagcagc ccccgcccgc gcagcagc atg gag etc tgg ggg gec 176 

Met Glu Leu Trp Gly Ala 
-20 * -15 



tac 


etc 


etc 


etc 


tgc 


etc 


ttc 


tec 


etc 


ctg 


acc 


cag 


gtc 


acc 


acc 


gag 


224 


Tyr 


Leu 


Leu 


Leu 


Cys 


Leu 


Phe 


Ser 


Leu 


Leu 


Thr 


Gin 


Val 


Thr 


Thr 


Glu 












-10 










-5 










1 






cca 


cca 


acc 


cag 


aag 


ccc 


aag 


aag 


att 


gta 


aat 


gee 


aag 


aaa 


gat 


gtt 


272 


Pro 


Pro 


Thr 


Gin 


Lys 


Pro 


Lys 


Lys 


He 


Val 


Asn 


Ala 


Lys 


Lys 


Asp 


Val 








5 










10 










15 










gtg 


aac 


aca 


aag 


atg 


ttt 


gag 


gag 


etc 


aag 


age 


cgt 


ctg 


gac 


acc 


ctg 


320 


Val 


Asn 


Thr 


Lys 


Met 


Phe 


Glu 


Glu 


Leu 


Lys 


Ser 


Arg 


Leu 


Asp 


Thr 


Leu 






20 










25 










30 












gec 


cag 


gag 


gtg 


gec 


ctg 


ctg 


aag 


gag 


cag 


cag 


gee 


ctg 


cag 


acg 


gtc 


368 


Ala 


Gin 


Glu 


Val 


Ala 


Leu 


Leu 


Lys 


Glu 


Gin 


Gin 


Ala 


Leu 


Gin 


Thr 


Val 




35 










40 










45 










50 




tgc 


ctg 


aag 


ggg 


acc 


aag 


gtg 


cac 


atg 


aaa 


tgc 


ttt 


ctg 


gec 


ttc 


acc 


416 


Cys 


Leu 


Lys 


Gly 


Thr 


Lys 


Val 


His 


Met 


Lys 


Cys 


Phe 


Leu 


Ala 


Phe 


Thr 












55 










60 










65 






cag 


acg 


aag 


acc 


ttc 


cac 


gag 


tec 


age 


gag 


gac 


tgc 


ate 


teg 


cgc 


ggg 


464 


Gin 


Thr 


Lys 


Thr 


Phe 


His 


Glu 


Ser 


Ser 


Glu 


Asp 


Cys 


He 


Ser 


Arg 


Gly 










70 










75 










80 








ggc 


acc 


ctg 


age 


acc 


cct 


cag 


act 


ggc 


teg 


gag 


aac 


gac 


gec 


ctg 


tat 


512 


Gly 


Thr 


Leu 


Ser 


Thr 


Pro 


Gin 


Thr 


Gly 


Ser 


Glu 


Asn 


Asp 


Ala 


Leu 


Tyr 








85 










90 










95 










gag 


tac 


ctg 


cgc 


cag 


age 


gtg 


ggc 


aac 


gag 


gec 


gag 


ate 


tgg 


ctg 


ggc 


560 


Glu 


Tyr 


Leu 


Arg 


Gin 


Ser 


Val 


Gly 


Asn 


Glu 


Ala 


Glu 


He 


Trp 


Leu 


Gly 






100 










105 










110 












etc 


aac 


gac 


atg 


gcg 


gee 


gag 


ggc 


acc 


tgg 


gtg 


gac 


atg 


acc 


ggc 


gee 


608 


Leu 


Asn 


Asp 


Met 


Ala 


Ala 


Glu 


Gly 


Thr 


Trp 


Val 


Asp 


Met 


Thr 


Gly 


Ala 




115 










120 










125 










130 




cgc 


ate 


gee 


tac 


aag 


aac 


tgg 


gag 


act 


gag 


ate 


acc 


gcg 


caa 


ccc 


gat 


656 
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Arg lie Ala Tyr Lys Asn Trp Glu Thr Glu lie Thr Ala Gin Pro Asp 

135 140 145 

9gc ggc aag acc gag aac tgc gcg gtc ctg tea ggc gcg gec aac ggc 704 
Gly Gly Lys Thr Glu Asn Cys Ala Val Leu Ser Gly Ala Ala Asn Gly 

150 155 160 

aag tgg ttc gac aag cgc tgc cgc gat cag ctg ccc tac ate tgc cag 752 
Lys Trp Phe Asp Lys Arg Cys Arg Asp Gin Leu Pro Tyr lie Cys Gin 

165 170 175 

ttc ggg ate gtg tagceggegg ggegggggee gtggggggcc tggaggaggg 804 
Phe Gly lie Val 
180 

caggagccgc gggaggcegg gaggagggtg gggaccttgc agcccccatc ctctccgt 862 

<210> 62 
<211> 202 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> 1. .21 

<400> 62 



Met 


Glu 


Leu 


Trp 


Gly Ala Tyr Leu Leu 


Leu 


Cys 


Leu 


Phe 


Ser Leu 


Leu 




-20 






-15 






-10 








Thr 


Gin 


Val 


Thr 


Thr Glu Pro Pro Thr 


Gin 


Lys 


Pro 


Lys 


Lys He 


Val 


-5 








1 


5 








10 




Asn 


Ala 


Lys 


Lys 


Asp Val Val Asn Thr 


Lys 


Met 


Phe 


Glu 


Glu Leu 


Lys 








15 


20 










25 




Ser 


Arg 


Leu 


Asp 


Thr Leu Ala Gin Glu 


Val 


Ala 


Leu 


Leu 


Lys Glu 


Gin 






30 




35 








40 






Gin 


Ala 


Leu 


Gin 


Thr Val Cys Leu Lys 


Gly 


Thr 


Lys 


Val 


His Met 


Lys 




45 






50 






55 








Cys 


Phe 


Leu 


Ala 


Phe Thr Gin Thr Lys 


Thr 


Phe 


His 


Glu 


Ser Ser 


Glu 


60 








65 




70 








75 


Asp 


Cys 


He 


Ser 


Arg Gly Gly Thr Leu 


Ser 


Thr 


Pro 


Gin 


Thr Gly 


Ser 










80 


85 








90 




Glu 


Asn 


Asp 


Ala 


Leu Tyr Glu Tyr Leu 


Arg 


Gin 


Ser 


Val 


Gly Asn 


Glu 








95 


100 










105 




Ala 


Glu 


He 


Trp 


Leu Gly Leu Asn Asp 


Met 


Ala 


Ala 


Glu 


Gly . Thr 


Trp 






110 




115 








120 






Val 


Asp 


Met 


Thr 


Gly Ala Arg He Ala 


Tyr 


Lys 


Asn 


Trp 


Glu Thr 


Glu 




125 






130 






135 








He 


Thr 


Ala 


Gin 


Pro Asp Gly Gly Lys 


Thr 


Glu 


Asn 


Cys 


Ala Val 


Leu 


140 








145 




150 








155 


Ser 


Gly 


Ala 


Ala 


Asn Gly Lys Trp Phe 


Asp 


Lys 


Arg 


Cys 


Arg Asp 


Gin 










160 


165 








170 




Leu 


Pro 


Tyr 


He 


Cys Gin Phe Gly He 


Val 


















175 


180 















<210> 63 

<211> 618 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5'UTR 
<222> 1. .194 

<220> 
<221> CDS 
<222> 195. .587 
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<220> 

<221> 3'UTR 
<222> 588. .618 

<220> 

<221> polyA_signal 
<222> 578. .583 

<220> 

<221> polyA_site 
<222> 604. .618 

<400> 63 

atttgcttag gtctgatcaa tctgctccac acaatttctc agtgatcctc tgcatctctg 60 

cctacaaggg cctccctgac acccaagttc atattgctca gaaacagtga acttgagttt 120 

ttcgttttac cttgatctct ctctgacaaa gaaatccaga tgatgcgaga cctgatgaag 180 



acaatacatg gaaa atg 


aca gtc 


ttg 


gaa 


ata 


act 


ttg 


get 


gtc 


ate 


ctg 


230 






Met 


Thr Val 


Leu 


Glu 


He 


Thr 


Leu 


Ala 


Val 


He 


Leu 










-20 








-15 










-10 




act 


eta 


ctg gga ctt 


gee ate 


ctg 


get 


att 


ttg 


tta 


aca 


aga 


tgg 


gca 


278 


Thr 


Leu 


Leu Gly Leu 

* -5 


Ala lie 


Leu 


Ala 


He 
1 


Leu 


Leu 


Thr 


Arg 


Trp 


Ala 




cga 


cgt 


aag caa agt 


gaa atg 


cat 


ate 


tec 


aga 


tac 


agt 


5 

tea 


gaa 


caa 


326 


Arg 


Arg 


Lys Gin Ser 


Glu Met 


His 


He 


Ser 


Arg 


Tyr 


Ser 


Ser 


Glu 


Gin 








10 




15 










20 










agt 


get 


aga ctt ctg 


gac tat 


gag 


gat 


ggt 


aga 


gga 


tec 


cga 


cat 


gca 


374 


Ser 


Ala 


Arg Leu Leu 


Asp Tyr 


Glu 


Asp 


Gly 


Arg 


Gly 


Ser 


Arg 


His 


Ala 






25 




30 










35 












tat 


tea 


aca caa agt 


gag aga 


tec 


aaa 


aga 


gat 


tac 


aca 


cca 


tea 


ace 


422 


Tyr 


Ser 


Thr Gin Ser 


Glu Arg 


Ser 


Lys 


Arg 


Asp 


Tyr 


Thr 


Pro 


Ser 


Thr 




40 






45 








50 










55 




aac 


tct 


eta gca ctg 


tct cga 


tea 


agt 


att 


get 


tta 


cct 


caa 


gga 


tec 


470 


Asn 


Ser 


Leu Ala Leu 


Ser Arg 


Ser 


Ser 


He 


Ala 


Leu 


Pro 


Gin 


Gly 


Ser 








60 








65 










70 






atg 


agt 


agt ata aaa 


tgt tta 


caa 


aca 


act 


gaa 


gaa 


ctt 


cct 


tec 


aga 


518 


Met 


Ser 


Ser lie Lys 


Cys Leu 


Gin 


Thr 


Thr 


Glu 


Glu 


Leu 


Pro 


Ser 


Arg 








75 






80 










85 








act. 


gca 


gga gee atg 


agt aag 


ttc 


ttt 


ttc 


tgc 


cct 


tta 


att 


etc 


atg 


566 


Thr 


Ala 


Gly Ala Met 


Ser Lys 


Phe 


Phe 


Phe 


Cys 


Pro 


Leu 


He 


Leu 


Met 








90 




95 










100 











tgc ttt get tta eta aac tgt tagaatatgt aagacgaaaa aaaaaaaaaa a 618 
Cys Phe Ala Leu Leu Asn Cys 
105 110 



<210> 64 
<211> 131 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> 1..22 



<400> 64 



Met Thr Val 


Leu 


Glu 


He Thr 


Leu 


Ala Val 


He 


Leu Thr Leu Leu Gly 


-20 








-15 






-10 


Leu Ala He 


Leu 


Ala 


He Leu 


Leu 


Thr Arg 


Trp Ala Arg Arg Lys Gin 


-5 






1 






5 


10 


Ser Glu Met 


His 


He 


Ser Arg 


Tyr 


Ser Ser 


Glu 


Gin Ser Ala Arg Leu 






15 






20 




25 


Leu Asp Tyr 


Glu 


Asp 


Gly Arg 


Gly 


Ser Arg 


His 


Ala Tyr Ser Thr Gin 




30 








35 




40 


Ser Glu Arg 


Ser 


Lys 


Arg Asp 


Tyr 


Thr Pro 


Ser 


Thr Asn Ser Leu Ala 
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45 50 
Leu Ser Arg Ser Ser He Ala Leu 

60 65 
Lys Cys Leu Gin Thr Thr Glu Glu 
75 80 
Met Ser Lys Phe Phe Phe Cys Pro 
95 

Leu Asn Cys 



* 55 

Pro Gin Gly Ser Met Ser Ser lie 
70 

Leu Pro Ser Arg Thr Ala Gly Ala 

85 90 
Leu He Leu Met Cys Phe Ala Leu 
100 105 



<210> 65 

<211> 836 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5 r UTR 
<222> 1. .176 

<220> 
<221> CDS 
<222> 177.. 767 



<220> 

<221> 3 1 UTR 
<222> 768. .836 



<220> 

<221> polyA_signal 
<222> 814. .819 



<220> 

<221> polyA_site 
<222> 822. .836 



<400> 65 

aatctgctcc acgcaatttc tcagtgatcc tctgcatctc tgcctacaag ggcctccctg 60 
acacccaagt tcatattgct cagaaacagt gaacttgagt ttttcatttt accttgatct 120 
ctctctgaca aagaaatcca gatgatgcga gacctgatga agacaataca tggaaa atg 179 

Met 



aca 


gtc 


ttg 


gaa 


ata 


act 


ttg 


get 


gtc 


ate 


ctg 


act 


eta 


ctg 


gga 


ctt 


227 


Thr 


val 


Leu 


Glu 


He 


Thr 


Leu 


Ala 


Val 


He 


Leu 


Thr 


Leu 


Leu 


Gly 


Leu 




-2q 










-15 










»10 








-5 




gcc 


ate 


ctg 


get 


att 


ttg 


tta 


aca 


aga 


tgg 


gca 


cga 


cgt 


aag 


caa 


agt 


275 


Ala 


He 


Leu 


Ala 


He 
1 


Leu 


Leu 


Thr 


Arg 
5 


Trp 


Ala 


Arg 


Arg 


Lys 


Gin 


Ser 




gaa 


atg 


tat 


ate 


tec 


aga 


tac 


agt 


tea 


gaa 


caa 


agt 


get 


10 
aga 


ctt 


ctg 


323 


Glu 


Met 


Tyr 


He 


Ser 


Arg 


Tyr 


Ser 


Ser 


Glu 


Gin 


Ser 


Ala 


Arg 


Leu 


Leu 








15 










20 










25 








gac 


tat 


gag 


gat 


ggt 


aga 


gga 


tec 


cga 


cat 


gca 


tat 


tea 


aca 


caa 


agt 


371 


Asp 


Tyr 


Glu 


Asp 


Gly 


Arg 


Gly 


Ser 


Arg 


His 


Ala 


Tyr 


Ser 


Thr 


Gin 


Ser 






30 










35 










40 












gag 


aga 


tec 


aaa 


aga 


gat 


tac 


aca 


cca 


tea 


acc 


aac 


tct 


eta 


gca 


ctg 


419 


Glu 


Arg 


Ser 


Lys 


Arg 


Asp 


Tyr 


Thr 


Pro 


Ser 


Thr 


Asn 


Ser 


Leu 


Ala 


Leu 




45 










50 










55 










60 




tct 


cga 


tea 


agt 


att 


get 


tta 


cct 


caa 


gga 


tec 


atg 


agt 


agt 


ata 


aaa 


467 


Ser 


Arg 


Ser 


Ser 


He 


Ala 


Leu 


Pro 


Gin 


Gly 


Ser 


Met 


Ser 


Ser 


He 


Lys 












65 










70 










75 






tgt 


tta 


caa 


aca 


act 


gaa 


gaa 


cct 


cct 


tec 


aga 


act 


gca 


gga 


gcc 


atg 


515 


Cys 


Leu 


Gin 


Thr 


Thr 


Glu 


Glu 


Pro 


Pro 


Ser 


Arg 


Thr 


Ala 


Gly 


Ala 


Met 










80 










85 










90 








atg 


caa 


ttc 


aca 


gcc 


cct 


att 


ccc 


gga 


get 


aca 


gga 


cct 


ate 


aag 


etc 


563 


Met 


Gin 


Phe 


Thr 


Ala 


Pro 


He 


Pro 


Gly 


Ala 


Thr 


Gly 


Pro 


He 


Lys 


Leu 








95 










100 










105 
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tct 


caa 


aaa 


acc 


att gtg 


caa 


act 


eta 


gga 
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<210> 66 
<211> 197 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> 1. .22 



<400> 66 

Met Thr Val Leu Glu lie Thr Leu Ala Val lie Leu Thr Leu Leu Gly 

-20 -15 -10 

Leu Ala He Leu Ala He Leu Leu Thr Arg Trp Ala Arg Arg Lys Gin 

-5 1 5 10 

Ser Glu Met Tyr He Ser Arg Tyr Ser Ser Glu Gin Ser Ala Arg Leu 

15 20 25 

Leu Asp Tyr Glu Asp Gly Arg Gly Ser Arg His Ala Tyr Ser Thr Gin 

30 35 40 

Ser Glu Arg Ser Lys Arg Asp Tyr Thr Pro Ser Thr Asn Ser Leu Ala 

45 50 55 

Leu Ser Arg Ser Ser He Ala Leu Pro Gin Gly Ser Met Ser Ser He 

60 65 70 

Lys Cys Leu Gin Thr Thr Glu Glu Pro Pro Ser Arg Thr Ala Gly Ala 
75 80 85 90 

Met Met Gin Phe Thr Ala Pro He Pro Gly Ala Thr Gly Pro He Lys 

95 100 105 

Leu Ser Gin Lys Thr He Val Gin Thr Leu Gly Pro He Val Gin Tyr 

110 115 ' 120 

Pro Gly Ser Asn Gly Arg He Asn He Ser Gin Leu Thr Ser Glu Asp 

125 130 135 

Leu Thr Gly Ala Lys Gly Arg Val Thr Ser Gly Pro Gin Phe Pro Asn 

140 145 150 

Ser His His Val Pro Glu Asn Leu His Gly Tyr Met Asn Ser Leu Ser 
155 160 165 170 

Leu Phe Ser Pro Ala 
175 



<210> 67 

<211> 789 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> 5'UTR 
<222> 1. .62 
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<220> 
<221> CDS 
<222> 63. .572 

<220> 

<221> 3'UTR 
<222> 573. .789 

<220> 

<221> polyA_jsignal 
<222> 750. .755 

<220> 

<221> polyA_site 
<222> 774. .789 

<40O> 67 

atatgtcatc aggccccccg cctgggaggt gtgctgccag agattttgcc tcttcaaggt 60 
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gaggagggac geccagggtg gggaggaaga gtctgeaage agggctgtgg agttagggtt 652 
caccccaatg ggaccaccct cc tgggtccc ctggtgccgt ttttccttag aaatcagaga 712 
aatgggaaag ggggggaaac tgattttaca cttaaataat aaaatcctat tagtaactcc 772 
gaaaaaaaaa aaaaaaa 789 

<210> 68 
<211> 170 
<212> PRT 

<213> Homo sapiens 
<400> 68 

Met Arg Leu Gin Gly Ala He Phe Val Leu Leu Pro His Leu Gly Pro 
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<210> 69 

<211> 2556 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5'UTR 
<222> 1. .66 

<220> 
<221> CDS 
<222> 67. .2427 

<220> 

<221> 3'UTR 
<222> 2428. .2556 

<220> 

<221> polyA_signal 
<222> 2522. .2527 

<220> 

<221> polyA_site 
<222> 2541. .2556 

<400> 69 

gtccccgcgt ccctggcaat tcccgacttc ccaacggctt cctgctggca gccccgaagc 60 
cgcacc atg ttc cgc etc tgg ttg ctg ctg gec ggg etc tgc ggc etc 108 

Met Phe Arg Leu Trp Leu Leu Leu Ala Gly Leu Cys Gly Leu 

-15 -10 -5 
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180 185 






190 
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gtg ctg aat gga gag gca cct cct age eta gge'ect tec tct gtg gee 673 
Val Leu Asn Gly Glu Ala Pro Pro Ser Leu Gly Pro Ser Ser Val Ala 

195 200 205 

tec cca gag gac gtc cag gee ctg atg tac ctg aga ggg cag ctg gag 721 
Ser Pro Glu Asp Val Gin Ala Leu Met Tyr Leu Arg Gly Gin Leu Glu 

210 215 220 

cct cag tgg aag atg ttg cag tgc cat cct cac ctg gtg get 763 
Pro Gin Trp Lys Met Leu Gin Cys His Pro His Leu Val Ala 

225 230 235 

tgaaategge caaggtggga gcatttacac cgcagaaatg acaccgcacg ccagcgcccc 823 

gcggccgcga tccggacccc aagcccacgg ctccctcgac tctggggcac ggaaccccgc 883 

ccactcccaa tccccgcgcc ccgccctctc ccacccgtgc ttcccccgct ccacccctca 943 

cctcacctcg cccccgcccc acccatcgcg ccccggcggc tgttattgtt eggctggget 1003 

eggtegggeg ctgtctccct cggctctgcg ggtgtcagtt cgtccggctt cctcacagcc 1063 

cctcactccc ggeggctgae ageagcageg geggeggegg gcggcgcctg gcgtttcgag 1123 

getgagegge accggggttg gggegeggag gaggagcagc agegggagga ggagccgtgt 1183 

gccctggcac tgagcggccg cggccatggc gtacgectat ctcttcaagt acatcataat 1243 

cggcgacaca ggtgttggta aatcatgett attgetacag tttacagaca agaggttcag 1303 

ccagtgcatg accttactat tggtgtagag ttcggtgctc gaatgataac tattgatggg 1363 

aaacagataa aacttcagat atgggatacg gcagggcaag aatcctttcg ttccatcaca 1423 

aggtegtatt acagaggtgc . agcaggagct ttactagttt acgatattac aeggagagat 14 83 

acattcaacc acttgacaac ctggttagaa gatgcccgcc agcattccaa ttccaacatg 1543 

gtcattatgc ttattggaaa taaaagtgat ttagaatcta gaagaaaaaa aaaaagaaaa 1603 



<210> 72 

<211> 252 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> 1. .17 



<220> 

<221> UNSURE 
<222> 173 

<223> Xaa = Ala, Gly 



<400> 72 



Met 


Gly 


Pro 
-15 


His 


Leu 


His 


Leu 


Cys Leu 
-10 


Cys 


val Pro 


Asp 
-5 


Leu 


Arg* 


Ser 


Leu 


Arg 
1 


Val 


Cys 


Val 


Ser 
5 


Leu 


Trp Ser 


Val 


His His 
10 


Arg 


Pro 


His 


Glu 
15 


Ser 


Leu 


Ala 


Arg 


Glu 
20 


Glu 


Ala 


Leu Thr 


Ala 
25 


Leu Gly 


Lys 


Leu 


Leu 
30 


Tyr 


Leu 


Leu 


Asp 


Gly 
35 


Met 


Leu 


Asp 


Gly Gin 
40 


Val 


Asn Ser 


Gly 


lie 
45 


Ala 


Ala 


Thr 


Pro 


Ala 
50 


Ser 


Ala 


Ala 


Ala 


Ala Thr 
55 


Leu 


Asp Val 


Ala 
60 


Val 


Arg 


Arg 


Gly 


Leu 
65 


Ser 


His 


Ala 


Ala 


Gin 
70 


Arg Leu 


Leu 


Cys Val 
75 


Ala 


Leu 


Gly 


Gin 


Leu 


Asp 


Arg 


Pro 


Pro 


Asp 


Leu 


Ala His 


Asp 


Gly Arg 


Ser 


Leu 


Trp 


Leu 


80 










85 








90 








95 


Asn 


He 


Arg 


Gly 


Lys 
100 


Glu 


Ala 


Ala Ala 


Leu 
105 


Ser Met 


Phe 


His 


Val 
110 


Ser 


Thr 


Pro 


Leu 


Pro 
115 


Val 


Met 


Thr 


Gly Gly 
120 


Phe 


Leu Ser 


Cys 


He 
125 


Leu 


Gly 


Leu 


Val 


Leu 
130 


Pro 


Leu 


Ala 


Tyr 


Gly Phe 
135 


Gin 


Pro Asp 


Leu 
140 


Val 


Leu 


Val 


Ala 


Leu 
145 


Gly 


Pro 


Gly 


His 


Gly 
150 


Leu Gin 


Gly 


Pro His 
155 


Xaa 


Ala 


Leu 


Leu 


Ala 


Ala 


Met 


Leu 


Arg 


Gly 


Leu 


Ala Gly 


Gly 


Arg Val 


Leu 


Ala 


Leu 


Leu 


160 










165 






170 








175 
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Glu Glu Asn Ser Thr Pro Gin Leu Ala Gly lie Leu Ala Arg Val Leu 

180 185 190 

Asn Gly Glu Ala Pro Pro Ser Leu Gly Pro Ser Ser Val Ala Ser Pro 

195 200 205 

Glu Asp Val Gin Ala Leu Met Tyr Leu Arg Gly Gin Leu Glu Pro Gin 

210 215 220 

Trp Lys Met Leu Gin Cys His Pro His Leu Val Ala 
225 230 235 

<210> 73 

<211> 879 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5 'UTR 
<222> 1. .8 

<220> 
<221> CDS 
<222> 9. .395 

<220> 

<221> 3 'UTR 
<222> 396. .879 

<220> 

<221> polyA_site 
<222> 864. .879 

<400> 73 

aggccaac atg gcc gtg ctg ctg ctg ctg etc cgt gec etc cgc egg ggt 50 
Met Ala Val Leu Leu Leu Leu Leu Arg Ala Leu Arg Arg Gly 







-15 








-10 








-5 










cca 


ggc 


ccg ggt 


cct 


egg 


ccg 


ctg 


tgg 


ggc 


cca 


ggc 


ccg 


gcc 


tgg 


agt 


98 


Pro 


Gly 
1 


Pro Gly 


Pro 


Arg 
5 


Pro 


Leu 


Trp 


Gly 


Pro 
10 


Gly 


Pro 


Ala 


Trp 


Ser 
15 




cca 


ggg 


ttc ccc 


gcc 


agg 


ccc 


ggg 


agg 


ggg 


egg 


ccg 


tac 


atg 


gcc 


age 


146 


Pro 


Gly 


Phe Pro 


Ala 
20 


Arg 


Pro 


Gly 


Arg 


Gly 
25 


Arg 


Pro 


Tyr 


Met 


Ala 
30 


Ser 




agg 


cct 


ccg ggg 


gac 


etc 


gcc 


gag 


get 


gga 


ggc 


cga 


get 


ctg 


cag 


age 


194 


Arg 


Pro 


Pro Gly 
35 


Asp 


Leu 


Ala 


Glu 


Ala 
40 


Gly 


Gly 


Arg 


Ala 


Leu 
45 


Gin 


Ser 




tta 


caa 


ttg, aga 


ctg 


eta 


acc 


cct 


acc 


ttt 


gaa 


ggg 


ate 


aac 


gga 


ttg 


242 


Leu 


Gin 


Leu Arg 
50 


Leu 


Leu 


Thr 


Pro 
55 


Thr 


Phe 


Glu 


Gly 


He 
60 


Asn 


Gly 


Leu 




ttg. 


ttg 


aaa caa 


cat 


tta 


gtt 


cag 


aat 


cca 


gtc 


aga 


etc 


tgg 


caa 


ctt 


290 


Leu 


Leu 
65 


Lys Gin 


His 


Leu 


Val 
70 


Gin 


Asn 


Pro 


Val 


Arg 
75 


Leu 


Trp 


Gin 


Leu 




tta 


ggt 


ggt act 


ttc 


tat 


ttt 


aac 


acc 


tea 


agg 


ttg 


aag 


cag 


aag 


aat 


338 


Leu 


Gly 


Gly Thr 


Phe 


Tyr 


Phe 


Asn 


Thr 


Ser 


Arg 


Leu 


Lys 


Gin 


Lys 


Asn 




80 








85 










90 










95 




aag 


gag 


aag gat 


aag 


teg 


aag 


ggg 


aag 


gcg 


cct 


gaa 


gag 


gac 


gaa 


ggt 


386 


Lys 


Glu 


Lys Asp 


Lys 
100 


Ser 


Lys 


Gly 


Lys 


Ala 
105 


Pro 


Glu 


Glu 


Asp 


Glu 
110 


Gly 





ata ttc ate tgatgttctt cagtcagtag ctgcctctgg atgtctttac 435 
He Phe He 

rtttctgttt weettttage aaggtgaaac cagtctggam aatggggaga tgggccgggt 495 

gcagtggctc acacttgtaa tegaaacget ttgggaggcc caggtggaag gatcacttga 555 

ggcctatacc acatagctag accctgtctc actgeaaatt aaaaggctgg gcgtggtggc 615 

tcacacctgt aatcccagca ctttgggagg ctgaggcagg cggatcacct gcaccctggc 675 

caacatggtg aaaccccgtc tttactaaaa atagaaaatt ageegggegt gatggcacac 735 

gectgtaate ccagctactc gggaggctga ggcaggagaa ttgettgaac ctgggaggtg 795 
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gaggttgctg tgagtggaga tcatgccatt gcactccagc ctgagcaaca agagcaaaac 855 
tccatcccaa aaaaaaaaaa aaaa 879 

<210> 74 
<211> 129 
<212> PRT 

<;213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> 1..16 

<400> 74 

Met Ala Val Leu Leu Leu Leu Leu Arg Ala Leu Arg Arg Gly Pro Gly 

-15 -10 -5 

Pro Gly Pro Arg Pro Leu Trp Gly Pro Gly Pro Ala Trp Ser Pro Gly 
15 10 15 

Phe Pro Ala Arg Pro Gly Arg Gly Arg Pro Tyr Met Ala Ser Arg Pro 

20 25 30 

Pro Gly Asp Leu Ala Glu Ala Gly Gly Arg Ala Leu Gin Ser Leu Gin 

35 40 45 

Leu Arg Leu Leu Thr Pro Thr Phe Glu Gly He Asn Gly Leu Leu Leu 

50 55 60 

Lys Gin His Leu Val Gin Asn Pro Val Arg Leu Trp Gin Leu Leu Gly 
65 70 75 80 

Gly Thr Phe Tyr Phe Asn Thr Ser Arg Leu Lys Gin Lys Asn Lys Glu 

85 90 95 

Lys Asp Lys Ser Lys Gly Lys Ala Pro Glu Glu Asp Glu Gly He Phe 
100 105 110 

He 

<210> 75 
<211> 1634 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> 5 ! UTR 
<222> 1. .87 

<220> 
<221> CDS 
<222> 88. .1269 

<220> 

<221> 3'UTR 
<222> 1270. .1634 

<220> 

<221> polyA_ signal 
<222> 1594. .1599 

<220> 

<221> polyA_site 
<222> 1619. .1634 

<400> 75 

aaagttcctc agcccttggc tcctgcccag tgtttagggt gttggcggag acaaagggga 60 
agagtcatcg cctgtcgggg ctaggat atg atg ggt gtg ttt gta gtt get get 114 

Met Met Gly Val Phe Val Val Ala Ala 

1 5 

aag cga acg ccc ttt gga get tac gga ggc ctt ctg aaa gac ttc act 162 
Lys Arg Thr Pro Phe Gly Ala Tyr Gly Gly Leu Leu Lys Asp Phe Thr 
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10 15 20 25 

get act gac ttg tct gaa ttt get gec aag get gec ttg tct get ggc 210 

Ala Thr Asp Leu Ser Glu Phe Ala Ala Lys Ala Ala Leu Ser Ala Gly 

30 35 40 

aaa gtc tea cct gaa aca gtt gac agt gtg att atg ggc aat gtc ctg 258 
Lys Val Ser Pro Glu Thr Val Asp Ser Val He Met Gly Asn Val Leu 

45 50 55 

cag agt tct tea gat get ata tat ttg gca agg cat gtt ggt ttg cgt 306 
Gin Ser Ser Ser Asp Ala He Tyr Leu Ala Arg His Val Gly Leu Arg 

SO 65 70 

gtg gga ate cca aag gag acc cca get etc acg att aat agg etc tgt 354 
Val Gly He Pro Lys Glu Thr Pro Ala Leu Thr He Asn Arg Leu Cys 

75 80 85 

ggt tct ggt ttt cag tec att gtg aat gga tgt cag gaa att tgt gtt 402 
Gly Ser Gly Phe Gin Ser He Val Asn Gly Cys Gin Glu He Cys Val 
90 95 100 105 

aaa gaa get gaa gtt gtt tta tgt gga gga acc gaa age atg age caa 450 
Lys Glu Ala Glu Val Val Leu Cys Gly Gly Thr Glu Ser Met Ser Gin 

110 115 120 

get ccc tac tgt gtc aga aat gtg cgt ttt gga acc aag ctt gga tea 498 
Ala Pro Tyr Cys Val Arg Asn Val Arg Phe Gly Thr Lys Leu Gly Ser 

125 130 135 

gat ate aag ctg gaa gat tct tta tgg gta tea tta aca gat cag cat 546 
Asp He Lys Leu Glu Asp Ser Leu Trp Val Ser Leu Thr Asp Gin His 

140 145 150 

gtc cag etc ccc atg gca atg act gca gag aat ctt get gta aaa cac 594 
Val Gin Leu Pro Met Ala Met Thr Ala Glu Asn Leu Ala Val Lys His • 

155 160 165 

aaa ata age aga gaa gaa tgt gac aaa tat gee ctg cag tea cag cag 642 
Lys He Ser Arg Glu Glu Cys Asp Lys Tyr Ala Leu Gin Ser Gin Gin 
170 175 180 185 

aga tgg aaa get get aat gat get ggc tac ttt aat gat gaa atg gca 690 
Arg Trp Lys Ala Ala Asn Asp Ala Gly Tyr Phe Asn Asp Glu Met Ala 

190 195 200 

cca att gaa gtg aag aca aag aaa gga aaa cag aca atg cag gta gac 738 
Pro He Glu Val Lys Thr Lys Lys Gly Lys Gin Thr Met Gin Val Asp 

205 210 215 

gag cat get egg ccc caa acc acc ctg gaa cag tta cag aaa ctt cct 786 
Glu His Ala Arg Pro Gin Thr Thr Leu Glu Gin Leu Gin Lys Leu Pro 

220 225 230 

cca gta ttc aag aaa gat gga act gtt act gca ggg aat gca teg ggt 834 
Pro Val Phe Lys Lys Asp Gly Thr Val Thr Ala Gly Asn Ala Ser Gly 

235 240 245 

gta get gat ggt get gga get gtt ate ata get agt gaa gat get gtt 882 
Val Ala Asp Gly Ala Gly Ala Val He He Ala Ser Glu Asp Ala Val 
250 255 260 265 

aag aaa cat aac ttc aca cca ctg gca aga att gtg ggc tac ttt gta 930 
Lys Lys His Asn Phe Thr Pro Leu Ala Arg He Val Gly Tyr Phe Val 

270 275 280 

tct gga tgt gat ccc tct ate atg ggt att ggt cct gtc cct get ate 978 
Ser Gly Cys Asp Pro Ser He Met Gly He Gly Pro Val Pro Ala He 

285 290 295 

agt ggg gca ctg aag aaa gca gga ctg agt ctt aag gac atg gat ttg 1026 
Ser Gly Ala Leu Lys Lys Ala Gly Leu Ser Leu Lys Asp Met Asp Leu 

300 305 310 

gta gag gtg aat gaa get ttt get ccc cag tac ttg get gtt gag agg 1074 
Val Glu Val Asn Glu Ala Phe Ala Pro Gin Tyr Leu Ala Val Glu Arg 

315 320 325 

agt ttg gat ctt gac ata agt aaa acc aat gtg aat gga gga gee att 1122 
Ser Leu Asp Leu Asp He Ser Lys Thr Asn Val Asn Gly Gly Ala He 
330 335 340 345 

get ttg ggt cac cca ctg gga gga tct gga tea aga att act gca cac 1170 
Ala Leu Gly His Pro Leu Gly Gly Ser Gly Ser Arg He Thr Ala His 
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350 355 360 

ctg gtt cac gaa tta agg cgt cga ggt gga aaa tat gcc gtt gga tea 1218 
Leu Val His Glu Leu Arg Arg Arg Gly Gly Lys Tyr Ala Val Gly Ser 

365 370 375 

get tgc att gga ggt ggc caa ggt att get gtc ate att cag age aca 1266 
Ala Cys He Gly Gly Gly Gin Gly He Ala Val He He Gin Ser Thr 

380 385 390 

gcc tgaagagacc agtgagctca ctgtgaccca tccttactct acttggccag 1319 
Ala 

gecacagtaa aacaagtgac cttcagagca gctgccacaa ctggccatgc cctgccattg 1379 
aaacagtgat taagtttgat caagecatgg tgacacaaaa atgeattgat catgaatagg 1439 
agcccatgct agaagtacat tctctcagat ttgaaccagt gaaatatgat gtatttctga 1499 
gctaaaactc aactatagaa gacattaaaa gaaategtat tettgecaag taaccaccac 1559 
ttctgectta gataatatga ttataaggaa atcaaataaa tgttgcctta acttcaaaca 1619 
aaaaaaaaaa aaaaa 1634 

<210> 76 

<211> 394 

<212> PRT 

<213> Homo sapiens 



<400> 76 



Met 


Met 


Glv 


val 


it lie; 


V CtJL 


V d J. 


A1 a 


A 7 a 


Lys 


Arg 




Pro 


file 


oj.y 


Til a 


1 








5 










10 














Tvr 


Glv 


Glv 


Leu 


Leu 


TiVQ 

JJjr O 


AD ±J 


IT 11 C 


Thr 


AT a 


lil-L 






Ser 




Jr He 








20 










25 










30 






Ala 


Ala 


Lys 


Ala 


Ala 


Leu 


Ser 


Ala 


Glv 


Lys 


Val 


Ser 


Pro 


Glu 


Thr 


v ct x 






35 










40 










45 








ASD 


Ser 


Val 


He 


Met 


Glv 


Asn 


Val 


Leu 


Gin 


Ser 


Ser 


Ser 


Asp 


Ala 


Tie* 

IXC 




50 










55 










60 










TVX 


Leu 


Ala 


Ara 


His 


Val 


Glv 


Leu 


Arg 


Val 


Glv 


He 


Pro 


Lys 


Glu 


Thr 


65 










70 










75 










80 


Pro 


Ala 


Leu 


Thr 


He 


Asn 


Ara 


Leu 


Cys 


Glv 


Ser 


Glv 


Phe 


Gin 


Ser 


He 










85 










90 










95 




Val 


Asn 


Gly 


Cys 


Gin 


Glu 


He 


Cys 


Val 


Lys 


Glu 


Ala 


Glu 


Val 


Val 


Leu 








100 










105 










110 






Cys 


Gly 


Gly 


Thr 


Glu 


Ser 


Met 


Ser 


Gin 


Ala 


Pro 


Tyr 


Cys 


Val 


Arg 


Asn 






115 










120 










125 








Val 


Arg 


Phe 


Gly 


Thr 


Lys 


Leu 


Gly 


Ser 


Asp 


He 


Lys 


Leu 


Glu 


Asp 


Ser 




130 










135 










140 










Leu 


Trp 


Val 


Ser 


Leu 


Thr 


Asp 


Gin 


His 


Val 


Gin 


Leu 


Pro 


Met 


Ala 


Met 


145 










150 










155 










160 


Thr 


Ala 


Glu 


Asn 


Leu 


Ala 


Val 


Lys 


His 


Lys 


He 


Ser 


Arg 


Glu 


Glu 


Cys 










165 










170 










175 




Asp 


Lys 


Tyr 


Ala 


Leu 


Gin 


Ser 


Gin 


Gin 


Arg 


Trp 


Lys 


Ala 


Ala 


Asn 


Asp 








180 










185 










190 






Ala 


Gly 


Tyr 


Phe 


Asn 


Asp 


Glu 


Met 


Ala 


Pro 


He 


Glu 


Val 


Lys 


Thr 


Lys 






195 










200 










205 








Lys 


Gly 


Lys 


Gin 


Thr 


Met 


Gin 


Val 


Asp 


Glu 


His 


Ala 


Arg 


Pro 


Gin 


Thr 




210 










215 










220 










Thr 


Leu 


Glu 


Gin 


Leu 


Gin 


Lys 


Leu 


Pro 


Pro 


val 


Phe 


Lys 


Lys 


Asp 


Gly 


225 










230 










235 










240 


Thr 


Val 


Thr 


Ala 


Gly 


Asn 


Ala 


Ser 


Gly 


Val 


Ala 


Asp 


Gly 


Ala 


Gly 


Ala 










245 










250 










255 




Val 


He 


He 


Ala 


Ser 


Glu 


Asp 


Ala 


Val 


Lys 


Lys 


His 


Asn 


Phe 


Thr 


Pro 








260 










265 










270 






Leu 


Ala 


Arg 


lie 


Val 


Gly 


Tyr 


Phe 


Val 


Ser 


Gly 


Cys 


Asp 


Pro 


Ser 


He 






275 










280 










285 








Met 


Gly 


He 


Gly 


Pro 


Val 


Pro 


Ala 


He 


Ser 


Gly 


Ala 


Leu 


Lys 


Lys 


Ala 




290 










295 










300 










Gly 


Leu 


Ser 


Leu 


Lys 


Asp 


Met 


Asp 


Leu 


Val 


Glu 


Val 


Asn 


Glu 


Ala 


Phe 


305 










310 










315 










320 


Ala 


Pro 


Gin 


Tyr 


Leu 


Ala 


Val 


Glu 


Arg 


Ser 


Leu Asp 


Leu 


Asp 


He 


Ser 
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325 330 335 

Lys Thr Asn Val Asn Gly Gly Ala lie Ala Leu Gly His Pro Leu Gly 

340 345 350 

Gly Ser Gly Ser Arg He Thr Ala His Leu Val His Glu Leu Arg Arg 

355 360 365 

Arg Gly Gly Lys Tyr Ala Val Gly Ser Ala Cys He Gly Gly Gly Gin 

370 375 380 

Gly He Ala Val He He Gin Ser Thr Ala 
385 390 

<210> 77 
<211> 1642 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> 5»UTR 
<222> 1..68 

<220> 
<221> CDS 
<222> 69. .875 

<220> 

<221> 3'UTR 
<222> 876. .1642 

<220> 

<221> polyA_signal 
<222> 1599. .1604 

<220> 

<221> polyA_site 
<222> 1627. .1642 

<400> 77 

attttatagc ggccgcgggc ggcggcggca gcggttggag gttgtaggac cggcgaggaa 60 
taggaatc atg gcg get gcg ctg ttc gtg ctg ctg gga ttc gcg ctg ctg 110 

Met Ala Ala Ala Leu Phe Val Leu Leu Gly Phe Ala Leu Leu 

-20 -15 -10 



ggc 


acc 


cac 


gga 


gec 


tec 


ggg 


get 


gec 


ggc 


aca 


gtc 


ttc 


act 


acc 


gta . 


158 


Gly 


Thr 


His 


Gly 


Ala 


Ser 


Gly 


Ala 


Ala 


Gly 


Thr 


Val 


Phe 


Thr 


Thr 


Val 






-5 










1 








5 










10 




gaa 


gac 


ctt 


ggc 


tec 


aag 


ata 


etc 


etc 


acc 


tgc 


tec 


ttg 


aat 


gac 


age 


206 


Glu 


Asp 


Leu 


Gly 


Ser 


LyB 


He 


Leu 


Leu 


Thr 


Cys 


Ser 


Leu 


Asn 


Asp 


Ser 












15 










20 










25 






gec 


aca 


gag 


gtc 


aca 


ggg 


cac 


cgc 


tgg 


ctg 


aag 


ggg 


ggc 


gtg 


gtg 


ctg 


254 


Ala 


Thr 


Glu 


Val 


Thr 


Gly 


His 


Arg 


Trp 


Leu 


Lys 


Gly 


Gly 


Val 


Val 


Leu 










30 










35 










40 








aag 


gag 


gac 


gcg 


ctg 


ccc 


ggc 


cag 


aaa 


acg 


gag 


ttc 


aag 


gtg 


gac 


tec 


302 


Lys 


Glu 


Asp 


Ala 


Leu 


Pro 


Gly 


Gin 


Lys 


Thr 


Glu 


Phe 


Lys 


Val 


Asp 


Ser 








45 










50 










55 










gac 


gac 


cag 


tgg 


gga 


gag 


tac 


tec 


tgc 


gtc 


ttc 


etc 


ccc 


gag 


ccc 


atg 


350 


Asp 


Asp 


Gin 


Trp 


Gly 


Glu 


Tyr 


Ser 


Cys 


Val 


Phe 


Leu 


Pro 


Glu 


Pro 


Met 






60 










65 










70 












ggc 


acg 


gee 


aac 


ate 


cag 


etc 


cac 


ggg 


cct 


ccc 


aga 


gtg 


aag 


gee 


gtg 


398 


Gly 


Thr 


Ala 


Asn 


He 


Gin 


Leu 


His 


Gly 


Pro 


Pro 


Arg 


Val 


Lys 


Ala 


Val 




75 










80 










85 










90 




aag 


teg 


tea 


gaa 


cac 


ate 


aac 


gag 


ggg 


gag 


acg 


gec 


atg 


ctg 


gtc 


tgc 


446 


Lys 


Ser 


Ser 


Glu 


His 


He 


Asn 


Glu 


Gly 


Glu 


Thr 


Ala 


Met 


Leu 


Val 


Cys 












95 










100 










105 






aag 


tea 


gag 


tec 


gtg 


cca 


cct 


gtc 


act 


gac 


tgg 


gec 


tgg 


tac 


aag 


ate 


494 


Lys 


Ser 


Glu 


Ser 


Val 


Pro 


Pro 


val 


Thr 


Asp 


Trp 


Ala 


Trp 


Tyr 


Lys 


lie 
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110 115 120 



act 


gac 


tct 


gag 


gac 


aag 


gcc 


etc 


atg 


aac 


ggc 


tec 


gag 


age 


agg 


ttc 


542 


rnr 


Asp 


Ser 


Glu 


Asp 


Lys 


Ala 


Leu 


Met 


Asn 


Gly 


Ser 


Glu 


Ser 


Arg 


Phe 








125 










130 










135 










ttc 


gtg 


agt 


tec 


teg 


cag 


ggc 


ctg 


tea 


gag 


eta 


cac 


att 


gag 


aac 


ctg 


590 


pne 


val 


Ser 


Ser 


Ser 


Gin 


Gly 


Leu 


Ser 


Glu 


Leu 


His 


He 


Glu 


Asn 


Leu 






140 










145 










150 












aac 


atg 


gag 


gcc 


gac 


ccc 


ggc 


cag 


tac 


egg 


tgc 


aac 


ggc 


acc 


age 


tec 


638 


Asn 


Met 


Glu 


Ala 


Asp 


Pro 


Gly 


Gin 


Tyr 


Arg 


Cys 


Asn 


Gly 


Thr 


Ser 


Ser 




155 










160 










165 










170 




aag 


ggc 


tec 


gac 


cag 


gcc 


ate 


ate 


acg 


etc 


cgc 


gtg 


cgc 


age 


cac 


ctg 


686 


Lys 


Gly 


Ser 


Asp 


Gin 


Ala 


He 


He 


Thr 


Leu 


Arg 


Val 


Arg 


Ser 


His 


Leu 












175 










180 










185 






gcc 


gcc 


etc 


tgg 


ccc 


ttc 


ctg 


ggc 


ate 


gtg 


get 


gag 


gtg 


ctg 


gtg 


ctg 


734 


Ala 


Ala 


Leu 


Trp 


Pro 


Phe 


Leu 


Gly 


He 


Val 


Ala 


Glu 


Val 


Leu 


Val 


Leu 










190 










195 










200 








gtc 


acc 


ate 


ate 


ttc 


ate 


tac 


gag 


aag 


cgc 


egg 


aag 


ccc 


gag 


gac 


gtc 


782 


val 


Thr 


lie 


lie 


Phe 


He 


Tyr 


Glu 


Lys 


Arg 


Arg 


Lys 


Pro 


Glu 


Asp 


Val 








205 










210 










215 










ctg 


gat 


gat 


gac 


gac 


gcc 


ggc 


tct 


gca 


ccc 


ctg 


aag 


age 


age 


ggg 


cag 


830 


Leu 


Asp 


Asp 


Asp 


Asp 


Ala 


Gly 


Ser 


Ala 


Pro 


Leu 


Lys 


Ser 


Ser 


Gly 


Gin 






220 










225 










230 












cac 


cag 


aat 


gac 


aaa 


ggc 


aag 


aac 


gtc 


cgc 


cag 


agg 


aac 


tct 


tec 




875 


His 


Gin 


Asn 


Asp 


Lys 


Gly 


Lys 


Asn 


Val 


Arg 


Gin 


Arg 


Asn 


Ser 


Ser 







235 240 245 



tgaggcaggt ggcccgagga cgctccctgc tccgcgtctg cgccgecgcc ggagtccact 935 
cccagtgctt gcaagattcc aagttctcac ctcttaaaga aaacccaccc egtagattec 995 
catcatacac ttccttcttt tttaaaaaag ttgggttttc tccattcagg attctgttcc 1055 
ttaggatttt ttccttctga agtgtttcac gagagecegg gagctgetge cctgcggccc 1115 
cgtctgtggc tttcagcctc tgggtctgag teatggcegg gtgggcggca cagccttctc 1175 
cactggccgg agtcagtgcc aggtccttgc cctttgtgga aagtcacagg tcacacgagg 1235 
ggccccgtgt cctgcctgtc tgaagccaat gctgtctggt tgcgccattt ttgtgctttt 1295 
atgtttaatt ttatgagggc caegggtctg tgttcgactc agectcaggg acgactctga 1355 
cctcttggcc acagaggact cacttgccca caccgagggc gaccccgtca cagcctcaag 1415 
tcactcccaa gccccctcct tgtctgtgca teegggggea gctctggagg gggtttgctg 1475 
gggaactggc gccatcgccg ggactccaga acegcagaag cctccccagc tcacccctgg 1535 
aggaeggecg gctctctata gcaccagggc tcacgtggga acccccctcc cacccaccgc .1595 
cacaataaag atcgccccca cctccaccct caaaaaaaaa aaaaaaa 1642 

<210> 78 

<211> 269 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> 1. .21 



<400> 78 



Met Ala 


Ala 


Ala 


Leu 


Phe 


Val 


Leu 


Leu 


Gly 


Phe 


Ala 


Leu 


Leu 


Gly 


Thr 


-20 










-15 










-10 










His Gly 


Ala 


Ser 


Gly 


Ala 


Ala 


Gly 


Thr 


Val 


Phe 


Thr 


Thr 


Val 


Glu 


Asp 


-5 








1 








5 










10 


Leu Gly 


Ser 


Lys 


He 


Leu 


Leu 


Thr 


Cys 


Ser 


Leu 


Asn 


Asp 


Ser 


Ala 


Thr 






15 










20 








25 






Glu Val 


Thr 
30 


Gly 


His 


Arg 


Trp 


Leu 
35 


Lys 


Gly 


Gly 


Val 


Val 
40 


Leu 


Lys 


Glu 


Asp Ala 


Leu 


Pro 


Gly 


Gin 


Lys 


Thr 


Glu 


Phe 


Lys 


Val 


Asp 


Ser 


Asp 


Asp 


45 










50 










55 










Gin Trp 


Gly 


Glu 


Tyr 


Ser 


Cys 


Val 


Phe 


Leu 


Pro 


Glu 


Pro 


Met 


Gly 


Thr 


60 








65 










70 








75 


Ala Asn 


He 


Gin 


Leu 
80 


His 


Gly 


Pro 


Pro 


Arg 
85 


Val 


Lys 


Ala 


Val 


Lys 
90 


Ser 
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p ^ 

ser 


Glu 


His 


lie 


Asn Glu Gly Glu Thr Ala Met 


Leu Val Cys Lys Ser 








Q C 

yb 


i n ft 
100 


T ft C 

10b 




Ser 


vai 


Pro 


Pro Val Thr Asp Trp Ala Trp 


Tyr Lys He Thr Asp 






inn 
110 




lib 


i o ft 

120 


Ser 


CjlU 


ASp 


ays 


Ala Leu Met Asn Gly Ser Glu 


Ser Arg Phe Phe Val 




1-S O 






130 


13b 


Ser 


ber 


Ser 


tain 


Gly Leu Ser Glu Leu His He 


Glu Asn Leu Asn Met 


1 An 








tar ten 


lbb 


bill 


Aia 


ASp 


Pro 


Gly Gin Tyr Arg Cys Asn Gly 


Thr Ser Ser Lys Gly 










IbU lob 


170 


Ser* 


Asp 


Gin 


Ala 


lie lie Tnr Leu Arg vai Arg 


Ser His Leu Ala Ala 








175 


loO 


1 QC 

lob 


Leu 


Trp 


Pro 


Phe 


Leu Gly He Val Ala Glu Val 


Leu Val Leu Val Thr 






190 




195 


200 


lie 


lie 


Phe 


He 


Tyr Glu Lys Arg Arg Lys Pro 


Glu Asp Val Leu Asp 




205 






210 


215 


Asp 


Asp 


Asp Ala 


Gly Ser Ala Pro Leu Lys Ser 


Ser Gly Gin His Gin 


220 








225 230 


235 


Asn 


Asp 


Lys 


Gly 


Lys Asn Val Arg Gin Arg Asn 


Ser Ser 



240 245 



<210> 79 

<211> 1466 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5'UTR 
<222> 1. .343 

<220> 
<221> CDS 
<222> 344. .1144 

<220> 

<221> 3 'UTR 
<222> 1145. .1466 

<400> 79 

attgtgactt tgggecagge tgggggaaat gaecegggag ggtcccatgc ggctacataa 60 

aattggcagc cttagaacta gtgggaaggc gggtgcgcga agtcgagggg eggagagagg 120 

gggceggagg agetgettte tgaatccaag ttcgtgggct ctctcagaag tcctcaggac 180 

ggagcagagg tggceggegg gcccggctga ctgcgcctyt gctttctttc cataaccttt 240 

tettteggae tcgaatcacg getgetgega agggtctagt tccggacact agggtgcccg 300 

aacgegctga tgccccgagt getegcaggg cttcccgcta acc atg ctg ccg ccg 355 



























Met 


Leu 


Pro 


Pro 




ccg 


egg 


ccc 


gca 


get 


gee 


ttg 


gcg 


ctg 


cct 


gtg 


etc 


ctg 


eta 


ctg 


ctg 


403 


Pro 


Arg 


Pro 


Ala 


Ala 


Ala 


Leu 


Ala 


Leu 


Pro 


Val 


Leu 


Leu 


Leu 


Leu 


Leu 




-25 










-20 










-15 










-10 




gtg 


gtg 


ctg 


acg 


ccg 


ccc 


ccg 


acc 


ggc 


gca 


agg 


cca 


tec 


cca 


ggc 


cca 


451 


Val 


Val 


Leu 


Thr 


Pro 
-5 


Pro 


Pro 


Thr 


Gly 


Ala 
1 


Arg 


Pro 


Ser 


Pro 


Gly 


Pro 




gat 


tac 


ctg 


egg 


cgc 


ggc 


tgg 


atg 


egg 


ctg 


eta 


gcg 


gag 


5 

ggc 


gag 


ggc 


499 


Asp 


Tyr 


Leu 


Arg 


Arg 


Gly 


Trp 


Met 


Arg 


Leu 


Leu 


Ala 


Glu 


Gly 


Glu 


Gly 








10 










15 










20 










tgc 


get 


ccc 


tgc 


egg 


cca 


gaa 


gag 


tgc 


gec 


gcg 


ccg 


egg 


ggc 


tgc 


ctg 


547 


Cys 


Ala 


Pro 


Cys 


Arg 


Pro 


Glu 


Glu 


Cys 


Ala 


Ala 


Pro 


Arg 


Gly 


Cys 


Leu 






25 










30 










35 












gcg 


ggc 


agg 


gtg 


cgc 


gac 


gcg 


tgc 


ggc 


tgc 


tgc 


tgg 


gaa 


tgc 


gee 


aac 


595 


Ala 


Gly 


Arg 


Val 


Arg 


Asp 


Ala 


Cys 


Gly 


Cys 


Cys 


Trp 


Glu 


Cys 


Ala 


Asn 




40 










45 










50 










55 




etc 


9ag 


ggc 


cag 


etc 


tgc 


gac 


ctg 


gac 


ccc 


agt 


get 


cac 


ttc 


tac 


ggg 


643 


Leu 


Glu 


Gly 


Gin 


Leu 


Cys 


Asp 


Leu 


Asp 


Pro 


Ser 


Ala 


His 


Phe 


Tyr 


Gly 
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70 






cac 


tgc 


99C 


gag 


cag ctt 


gag tgc egg ctg gac 


aca 


ggc 


ggc gac 


ctg 


691 




Cys 


Pi tr 

vjj.y 


ulU 


VjXIl JJCU 


oxu uys Arg x»eu Asp 


Thr Gly Gly Asp 


Leu 










/ 3 




fin 






as 






age 


cgc 


99 a 


gag 


gi-g ccg 


gaa ecu ccg cgc gec 


tgt 


cgt 


teg cag 


agt 






Arg 




m ii 


vax IriO 


Li-Lu r* ro Jjeu uys A±a 

J? -J 




Arg 




Ser 




ccg 


etc 


tgc 


ggg 


tec gac 


ggt cac ace tac tec 


cag 


ate 


tgc cgc 


ctg 


787 




Leu 

inc 


Cys 


urxy 


oci M.Sp 


oiy iixB inr xyr oer 
inn 


Gin 
115 


He 


Cys Arg 


Leu 




cag 


9 a 9 


9 C 9 


gec 


C 9 C gec 


egg ccc gac gee aac 


etc 


act 


gtg gca 


cac 


835 


ran n 


til n 

VjXU 


Aid 


Til a 

ax a 


Arg Aia 


Arg irro Asp ax a Asn 


Leu 


Thr 


Val Ala 


TT 4 e* 

rilS 














Tin 








X J b 




ccg 


999 


CCC 


tgc 


gaa ccg 


rrnrrt ^"»a/*r 

999 ccc c ag ace g eg 


tea 


cat 


cca tat 


gac 


ooi 


Pro 




Pro 


Cys 


ljxu oer 

1 Aft 


oxy Fro vaxn xxe vax 

X4b 


Ser 


His 


Pro Tyr 
150 


Asp 




aCL 


tgg 


aau 


gtg 


aca ggg 


cag gat gtg ate ttt 


ggc 


tgt 


gaa gtg 


4- 4- +- 

CCC 


931 


xnr 


Trp 


Asn 


vai 


Thx Gly 


Cjxn Asp vax xxe pne 


Gly Cys Glu Val 


Pne 














T £ft 

loU 






165 






gec 


tac 


CCC 


atg 


gec tec 


ate gag tgg agg aag 


gat ggc 


ttg gac 


ate 


979 


Ala 


Tyr 


Pro 


Met 


Ala Ser 


He Glu Trp Arg Lys 


Asp Gly Leu Asp 


He 








x / u 






X Jo 




180 








cag 


ctg 


cca 


ggg 


gat gac 


ccc cac ate tct gtg 


cag 


ttt 


agg ggt 


gga 


1027 


VjrJLEL 


Leu 


Pro 


ojLy 


Asp Asp 


Pro Mis ixe ser vax 


Gin Phe Arg Gly 


Gly 














190 


195 










ccc 


cag 


agg 


ttt 


gag gtg 


act ggc tgg ctg cag 


ate 


cag get gtg 


cgt 


1075 


Pro 


Gin 


Arg 


Phe 


Glu Val 


Thr Gly Trp Leu Gin 


lie 


Gin Ala Val 


Arg 




200 








205 


210 








215 




ccc 


agt 


gat 


gag 


ggc act 


tac cgc tgc ctt ggc 


cca 


atg 


ccc tgg 


gtc 


1123 


Pro 


Ser 


Asp 


Glu 


Gly Thr 
220 


Tyr Arg Cys Leu Gly 
225 


Pro 


Met 


Pro Trp 
230 


Val 




aag 


tgg 


agg 


ccc 


ctg eta 


get tgacagtgct cacacctgac cagctgaact 


1174 


Lys 


Trp 


Arg 


Pro 
235 


Leu Leu 


Ala 













ctacaggcat cccccagctg cgatcactaa acctggttcc tgaggaggag gctgagagtg 1234 

aagagaatga cgattactac taggtccaga gctctggccc atgggggtgg gtgagegget 1294 

atagtgttca tccctgctct tgaaaagacc tggaaagggg agcagggtcc cttcatcgac 1354 

tgctttcatg ctgtcagtag ggatgatcat gggaggecta tttgactcca aggtagcagt 1414 

gtggtaggat agagacaaaa gctggaggag ggtagggaga gaagctgaga cc 1466 



<210> 80 
<211> 267 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> 1. .30 



<400> 80 



Met 


Leu 


Pro 


Pro 


Pro Arg 


Pro Ala Ala Ala Leu Ala 


Leu 


Pro 


Val Leu 


-30 








-25 


-20 






-15 


Leu 


Leu 


Leu 


Leu 


Val Val 


Leu Thr Pro Pro Pro Thr 


Gly 


Ala 


Arg Pro 










-10 


-5 






1 


Ser 


Pro 


Gly 


Pro 


Asp Tyr 


Leu Arg Arg Gly Trp Met 


Arg 


Leu 


Leu Ala 






5 






10 


15 






Glu 


Gly 


Glu 


Gly 


Cys Ala 


Pro Cys Arg Pro Glu Glu 


Cys 


Ala 


Ala Pro 




20 








25 30 








Arg 


Gly 


Cys 


Leu 


Ala Gly 


Arg Val Arg Asp Ala Cys 


Gly 


Cys 


Cys Trp 


35 








40 


45 






50 


Glu 


Cys 


Ala 


Asn 


Leu Glu 


Gly Gin Leu Cys Asp Leu 


Asp 


Pro 


Ser Ala 










55 


60 






65 


His 


Phe 


Tyr 


Gly 


His Cys 


Gly Glu Gin Leu Glu Cys 


Arg 


Leu 


Asp Thr 
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70 




75 










80 








oxy 


Asp 
85 


Leu 


Got* Atvt f2T i.r 

ocx >i x y oiy 


uXU VaJ. 

90 


Pro 


oXU 


Pro 


Leu 
95 


Cys 


Aia 


Cys 




Cat* 

100 


OXil 


OCX. 


riv lieu y o 

105 


d"\ \r Cor 
oxy OCX 


A an 
ns^ 


G 1 V 
oxy 


xix a 
110 


JL XIX 


Tyr 


Ser 


oxn 


He 


v*y s 


Am 




Gin Gin Ala 

UXU OX IX ZTLXCL 


A "1 n A r rr 
rvxct nJ.y 


Al a 






A an 


AT *-* 

Hid 


Asn 


Leu 


115 








120 






125 










130 


"»1XX 


v ai 


Ala 


His 


riu oiy nu 
135 


v»y& OX IX 


Cor 

OCX 

14 0 


Glv 
oxy 


xr x. \j 


OXxl 


Tl o 
xxc 


val 
x*± ■? 


Ser 


Hlq 
xixo 


irx o 


'TV** 


150 


J.X1X lip noil 


V a X J. XIX 

155 


oxy 


OX XI 




V ax 


Tl o 
xxc 

160 


xriie 


oxy 




ox u 


Val 
165 


Phe 


Si n *T*\rr Pro 


Mor* Al A 
lie L. rU. ct 

170 


Cor 

OCX 


Tl o 
xxc 


m ii 

ox u 


Trp 
175 


Arg 


Lys 


Asp 


wiy 


Leu 
180 


Asp 


Tl o 


oXn jjeu fro 
185 


oiy Asp 


Asp 


Pro 


TT -! „ 

iilS 
190 


Tl o 

xie 


Ser 


val 


bin 


Phe 


Arg 


Gly 


Gly 


Pro Gin Arg 


Phe Glu 


Val 


Thr 


Gly 


Trp 


Leu 


Gin 


He 


195 








200 






205 










210 


Gin 


Ala 


Val 


Arg 


Pro Ser Asp 
215 


Glu Gly 


Thr 
220 


Tyr 


Arg 


Cys 


Leu 


Gly 
225 


Pro 


Met 


Pro 


Trp 


Val 
230 


Lys Trp Arg 


Pro Leu 
235 


Leu 


Ala 













<210> 81 

<211> 1406 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5'UTR 
<222> 1. .26 

<220> 
<221> CDS 
<222> 27. .689 

<220> 

<221> 3'UTR 
<222> 690. .1406 

<220> 

<221> polyA_signal 
<222> 1302. .1307 

<220> 

<221> polyA_site 
<222> 1325. .1406 

<400> 81 

cccggaagtg cgcaggcgct . ggcaag atg gcg gga ggg gtg cgc ccg ctg egg 53 

Met Ala Gly Gly Val Arg Pro Leu Arg 
-30 -25 



ggc 


etc 


cgc 


gee 


ttg tgt 


cgc 


gtg 


ctg 


etc 


ttc 


ctt teg 


cag 


ttc 


tgc 


101 


Gly 


Leu 


Arg 


Ala 


Leu Cys 


Arg 


Val 


Leu 


Leu 


Phe 


Leu Ser 


Gin 


Phe 


Cys 








-20 








-15 








-10 










att 


ctg 


teg 


ggc 


ggt gaa 


agt 


act 


gaa 


ate 


cca 


cct tat 


gtg 


atg 


aag 


149 


He 


Leu 


Ser 


Gly 


Gly Glu 


Ser 


Thr 


Glu 


He 


Pro 


Pro Tyr 


Val 


Met 


Lys 






-5 








1 








5 








10 




tgt 


ccg 


age 


aat 


ggt ttg 


tgt 


age 


agg 


ctt 


cct 


gca gac 


tgt 


ata 


gac 


197 


Cys 


Pro 


Ser 


Asn 


Gly Leu 


Cys 


Ser 


Arg 


Leu 


Pro 


Ala Asp 


Cys 


He 


Asp 












15 








20 








25 






tgc 


aca 


aca 


aat 


ttc tec 


tgt 


ace 


tat 


ggg 


aag 


cct gtc 


act 


ttt 


gac 


245 


Cys 


Thr 


Thr 


Asn 


Phe Ser 


Cys 


Thr 


Tyr 


Gly 


Lys 


Pro Val 


Thr 


Phe 


Asp 










30 








35 








40 
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tgt gca gtg aaa cca tct gtt acc tgt gtt gat caa gac ttc aaa tec 
Cys Ala Val Lys Pro Ser Val Thr Cys Val Asp Gin Asp Phe Lys Ser 

45 50 55 

caa aag aac ttc ate att aac atg act tgc aga ttt tgc tgg cag ctt 
Gin Lys Asn Phe lie lie Asn Met Thr Cys Arg Phe Cys Trp Gin Leu 

60 65 70 

cct gaa aca gat tac gag tgt acc aac tec acc age tgc atg acg gtg 
Pro Glu Thr Asp Tyr Glu Cys Thr Asn Ser Thr Ser Cys Met Thr Val 
75 80 85 90 

tec tgt cct egg cag cgc tac cct gee aac tgc acg gtg egg gac cac 
Ser Cys Pro Arg Gin Arg Tyr Pro Ala Asn Cys Thr Val Arg Asp His 

95 100 105 

gtc cac tgc ttg ggt aac cgt act ttt ccc aaa atg eta tat tgc aat 
Val His Cys Leu Gly Asn Arg Thr Phe Pro Lys Met Leu Tyr Cys Asn 

110 115 120 

tgg act gga ggc tat aag tgg tct acg get ctg get eta age ate acc 
Trp Thr Gly Gly Tyr Lys Trp Ser Thr Ala Leu Ala Leu Ser lie Thr 

125 130 135 

etc ggt ggg ttt gga gca gac cgt ttc tac ctg ggc cag tgg egg gaa 
Leu Gly Gly Phe Gly Ala Asp Arg Phe Tyr Leu Gly Gin Trp Arg Glu 

140 145 150 

ggc etc ggc aag etc ttc age ttc ggt ggc ctg gga ata tgg acg ctg 
Gly Leu Gly Lys Leu Phe Ser Phe Gly Gly Leu Gly lie Trp Thr Leu 
155 160 165 170 

ata gac gtc ctg etc att gga gtt ggc tat gtt gga cca gca gat ggc 
He Asp Val Leu Leu He Gly Val Gly Tyr Val Gly Pro Ala Asp Gly 

175 180 185 

tct ttg tac att tagctgtggt gtgtgcttca gaaaggagca gggcttagaa 
Ser Leu Tyr He 
190 

aaagcccttt tgtcegtagg agttgatgtg gtgtgagtga tatatttcta tgtttttaat 
gtacagcatc tgtactttgt ttgecttgat aaaggtaaga taaatgaaac gctgaactat 
gctaatctgg aatttgtttt tatttgectg aaatatattt ttttctgtga aaaaattaaa 
aegtacttaa gecaggagaa tgaattatac agtgattgaa aatccattta attcctatga 
cttttgtttt gtattgecca agtcaaacta catcacttgt atctccagcc caaatgtagt 
ctgccttgaa aagtctttca gctgtgactg caggaagtgg gagtgttttt attgttagct 
aattgctgtg actgeaggaa gtgggagtgt ttctgttgtt ggctaattga agttattagg 
ctcagcttca gtcatgtgta agttttgcag tgtaatacat atgtagtctg gtctgtatat 
atgaaaattt gaattaaact gcagaatgtt tatgtctagt tatggtttaa attttcttag 
tagtatataa aaggtaagag tactgaaaaa ttaataaaat tgcaagttaa gaaataaaaa 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 
taaaaaaaaa aaaaaat 



293 



341 



389 



437 



485 



533 



581 



629 



677 



729 



789 
849 
909 
969 
1029 
1089 
1149 
1209 
1269 
1329 
1389 
1406 



<210> 82 
<211> 221 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> 1. .32 



<400> 82 














Met Ala Gly 


Gly 


Val Arg Pro Leu Arg 


Gly 


Leu 


Arg Ala Leu Cys Arg 


-30 






-25 






-20 


Val Leu Leu 


Phe 


Leu 


Ser Gin Phe Cys 


He 


Leu 


Ser Gly Gly Glu Ser 


-15 






-10 






-5 


Thr Glu He 


Pro 


Pro 


Tyr Val Met Lys 


Cys 


Pro 


Ser Asn Gly Leu Cys 


1 




5 




10 




15 


Ser Arg Leu 


Pro 


Ala 


Asp Cys lie Asp 


Cys 


Thr 


Thr Asn Phe Ser Cys 




20 




25 






30 


Thr Tyr Gly 


Lys 


Pro 


Val Thr Phe Asp 


Cys 


Ala 


Val Lys Pro Ser Val 


35 






40 




45 



76 
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Thr 


Cys 
50 


Val 


Asp 


Gin Asp 


Phe Lys Ser 
55 


Gin 


Lys 


Asn 
60 


Phe 


He 


He 


Asn 


Met 


Thr 


Cys 


Arg 


Phe Cys 


Trp Gin Leu 


Pro 


Glu 


Thr 


Asp 


Tyr 


Glu 


Cys 


65 








70 






75 










80 


Tnr 


Asn 


Ser 


Thr 


Ser Cys 

Q C 

ob 


Met Tnr val 


Ser 

art 


Cys 


Pro 


Arg 


Gin 


Arg 


Tyr 


Pro 


Ala 


Asn 


Cys 
100 


Thr Val 


Arg Asp His 
105 


Val 


His 


cys 


Leu 


Gly 
110 


Asn 


Arg 


rnr 


Pne 


Pro 
115 


Lys 


Met Leu 


Tyr Cys Asn 
120 


Trp 


Tnr 


Cjiy 


C3±y 
125 


Tyr 


Lys 


Trp 


Ser 


Thr 
130 


Ala 


Leu 


Ala Leu 


Ser He Thr 
135 


Leu 


Gly 


Gly 
140 


Phe 


Gly 


Ala 


Asp 


Arg 


Phe 


Tyr 


Leu 


Gly Gin 


Trp Arg Glu 


Gly 


Leu 


Gly 


Lys 


Leu 


Phe 


Ser 


145 








150 






155 










160 


Phe 


Gly 


Gly 


Leu 


Gly He 
165 


Trp Thr Leu 


He 
170 


Asp 


Val 


Leu 


Leu 


He 
175 


Gly 


Val 


Gly 


Tyr 


Val 
180 


Gly Pro 


Ala Asp Gly 
185 


Ser 


Leu 


Tyr 


He 









<210> 83 

<211> 1754 

<212> DNA 

<213> Homo sapiens 
<220i 

<221> 5'UTR 

<222> 1. .117 

<220> 
<221> CDS 
<222> 118. .510 

<220> 

<221> 3 'UTR 
<222> 511. .1754 

<220> 

<221> polyA_signal 
<222> 1718. .1723 

<220> 

<221> polyA_site 
<222> 1739. .1754 

<400> 83 

tccccggccg ccgccgttgc gctcgccgcg ctcgcactga agcccgggcc ctcgcgcgcc 60 
gcggttcgcc ccgcagcctc gccccctgcc cacccgggcg gccgtagggc ggtcacg 117 



atg 


ctg 


ccg 


ccc 


tta 


ccc 


tec 


cgc 


etc 


ggg ctg ctg 


ctg 


ctg 


ctg 


etc 


165 


Met 


Leu 


Pro 


Pro 


Leu 


Pro 


Ser 


Arg 


Leu 


Gly Leu Leu 


Leu 


Leu 


Leu 


Leu 










-20 










-15 






-10 








ctg 


tgc 


ccg 


gcg 


cac 


gtc 


ggc 


gga 


ctg 


tgg tgg get 


gtg 


ggc 


age 


ccc 


213 


Leu 


Cys 


Pro 
-5 


Ala 


His 


Val 


Gly 


Gly 
1 


Leu 


Trp Trp Ala 
5 


Val 


Gly 


Ser 


Pro 




ttg 


gtt 


atg 


gac 


cct 


acc 


age 


ate 


tgc 


agg aag gca 


egg 


egg 


ctg 


gee 


261 


Leu 


Val 


Met 


Asp 


Pro 


Thr 


Ser 


He 


Cys 


Arg Lys Ala 


Arg 


Arg 


Leu 


Ala 




10 










15 








20 








25 




ggg 


egg 


cag 


gec 


gag 


ttg 


tgc 


cag 


get 


gag ccg gaa 


gtg 


gtg 


gca 


gag 


309 


Gly 


Arg 


Gin 


Ala 


Glu 


Leu 


Cys 


Gin 


Ala 


Glu Pro Glu 


Val 


Val 


Ala 


Glu 












30 










35 






40 






ctg 


get 


egg 


ggc 


gec 


egg 


etc 


ggg 


gtg 


cga gag tgc 


cag 


ttc 


cag 


ttc 


357 


Leu 


Ala 


Arg 


Gly 


Ala 


Arg 


Leu 


Gly 


Val 


Arg Glu Cys 


Gin 


Phe 


Gin 


Phe 










45 










50 






55 








cgc 


ttc 


cgc 


cgc 


tgg 


aat 


tgc 


tec 


age 


cac age aag 


gec 


ttt 


gga 


cgc 


405 



77 
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Arg Phe Arg Arg Trp Asn Cys Ser Ser His Ser Lys Ala Phe Gly Arg 

60 65 70 

ate ctg caa cag ggt cag tgt ggg gag ggg cac cct gca agg acc ctg 453 
lie Leu Gin Gin Gly Gin Cys Gly Glu Gly His Pro Ala Arg Thr Leu 

75 80 85 

cct ccc agg ccc ctg ggg cag ccc tec cgc cgc agg ttt cag gtc cca 501 
Pro Pro Arg Pro Leu Gly Gin Pro Ser Arg Arg Arg Phe Gin Val Pro 
90 95 100 105 

ggc ccc age tgaccgcccc agcccgcgct gattgeaect gtctgeatte 550 
Gly Pro Ser 

acagacattc gggagaegge cttcgtgttc gccatcactg cggccggcgc cagccacgcc 610 

gtcaegcagg cctgttctat gggcgagctg ctgcagtgcg gctgccaggc gccccgcggg 670 

cgggcccctc cccggccctc cggcctgccc ggcacccccg gaccccctgg ccccgcgggc 730 

tccccggaag gcagcgccgc ctgggagtgg ggaggctgcg gcgacgacgt ggacttcggg 790 

gacgagaagt cgaggctctt tatgsacgcg cggcacaagc ggggacgegg agacatccgc .850 

gcgttggtgc aactgeacaa caacgaggcg ggcaggctgg ccgtgcggag ccacacgcgc 910 

accgagtgea aatgecaegg gctgtcggga tcatgcgcgc tgcgcacctg ctggcagaag 970 

ctgcctccat ttcgcgaggt gggegegegg ctgctggagc gcttccacgg cgcctcacgc 1030 

gtcatgggca ccaacgacgg caaggccctg ctgcccgccg tccgcacgct caagccgccg 1090 

ggecgagegg acctcctcta cgccgccgat tcgcccgact tctgcgcccc caaccgacgc 1150 

accggctccc ccggcacgcg cggtcgcgcc tgeaatagea gcgccccgga cctcagcggc 1210 

tgcgacctgc tgtgctgcgg ccgcgggcac cgecaggaga gcgtgcagct cgaagagaac 1270 

tgcctgtgcc gcttccactg gtgctgcgta gtacagtgcc accgctgccg tgtgcgcaag 1330 

gagctcagcc tctgcctgtg acccgccgcc cggccgctag actgacttcg cgcagcggtg 1390 

gctcgcacct gtgggacctc agggcacegg caccgggcgc ctctcgccgc tcgagcccag 1450 

cctctccctg ccaaagccca actcccaggg ctctggaaat ggtgaggcga ggggcttgag 1510 

aggaacgccc acccacgaag gcccagggcg ccagacggcc ccgaaaaggc geteggggag 1570 

cgtttaaagg acactgtaca ggccctccct ccccttggcc tctaggagga aacagttttt 1630 

tagactggaa aaaagccagt etaaaggect ctggatactg ggctccccag aactgetgge 1690 

cacaggatgg tgggtgaggt tagtatcaat aaagatattt aaaccaccaa aaaaaaaaaa 1750 

aaaa 1754 



<210> 84 

<211> 131 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> 1..24 



<400> 84 






Met 


Leu 


Pro 


Pro 


Leu Pro Ser Arg 










-20 


Leu 


Cys 


Pro 


Ala 
-5 


His Val Gly Gly 


Leu 


Val 


Met 


Asp 


Pro Thr Ser lie 




10 






15 


Gly 


Arg 


Gin 


Ala 


Glu Leu Cys Gin 


25 








30 


Leu 


Ala 


Arg 


Gly 


Ala Arg Leu Gly 










45 


Arg 


Phe 


Arg 


Arg 


Trp Asn Cys Ser 








60 




lie 


Leu 


Gin 


Gin 


Gly Gin Cys Gly 






75 




80 


Pro 


Pro 


Arg 


Pro 


Leu Gly Gin Pro 




90 






95 


Gly 


Pro 


Ser 






105 











Leu Gly Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


-15 








-10 




Leu Trp Trp 


Ala 


Val 


Gly 


Ser 


Pro 


1 




5 








Cys Arg Lys 


Ala 
20 


Arg 


Arg 


Leu 


Ala 


Ala Glu Pro 


Glu 


Val 


Val 


Ala 


Glu 


35 










40 


Val Arg Glu 


Cys 


Gin 


Phe 


Gin 


Phe 


50 








55 




Ser His Ser 


Lys 


Ala 


Phe 


Gly 


Arg 


65 






70 






Glu Gly His 


Pro 


Ala 
85 


Arg 


Thr 


Leu 


Ser Arg Arg 


Arg 
100 


Phe 


Gin 


Val 


Pro 



<210> 85 
<211> 1754 
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<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5»UTR 
<222> 1. .117 

<220> 
<221> CDS 
<222> 118. .510 

<220> 

<221> 3 r UTR 
<222> 511. .1754 

<220> 

<221> polyA_signal 
<222> 1718. .1723 

<220> 

<221> polyA_site 
<222> 1739. .1754 

<400> 85 

tccccggccg ccgccgttgc gctcgccgcg ctcgcactga agcccgggcc ctcgcgcgcc 60 
gcggttcgcc ccgcagcctc gccccctgcc cacccgggcg gccgtagggc ggtcacg 117 



atg 


ctg 


ccg 


ccc 


tta 


ccc 


tec 


cgc 


etc 


ggg 


ctg 


ctg 


ctg 


ctg 


ctg 


etc 


165 


Met 


Leu 


Pro 


Pro 


Leu 


Pro 


Ser 


Arg 


Leu 


Gly 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 










-20 










-15 










-10 








ctg 


tgc 


ccg 


gcg 


cac 


gtc 


ggc 


gga 


ctg 


tgg 


tgg 


get 


gtg 


ggc 


age 


ccc 


213 


Leu 


Cys 


Pro 
-5 


Ala 


His 


Val 


Gly 


Gly 
1 


Leu 


Trp 


Trp 


Ala 
5 


val 


Gly 


Ser 


Pro 




ttg 


gtt 


atg 


gac 


cct 


ace 


age 


ate 


tgc 


agg 


aag 


gca 


egg 


egg 


ctg 


gec 


261 


Leu 


Val 


Met 


Asp 


Pro 


Thr 


Ser 


He 


Cys 


Arg 


Lys 


Ala 


Arg 


Arg 


Leu 


Ala 




10 










15 










20 










25 




ggg 


egg 


cag 


gec 


gag 


ttg 


tgc 


cag 


get 


gag 


ccg 


gaa 


gtg 


gtg 


gca 


gag 


309 


Gly 


Arg 


Gin 


Ala 


Glu 


Leu 


Cys 


Gin 


Ala 


Glu 


Pro 


Glu 


Val 


Val 


Ala 


Glu 












30 










35 










40 






ctg 


get 


egg 


ggc 


gee 


egg 


etc 


ggg 


gtg 


cga 


gag 


tgc 


cag 


ttc 


cag 


ttc 


357 


Leu 


Ala 


Arg 


Gly 


Ala 


Arg 


Leu 


Gly 


Val 


Arg 


Glu 


Cys 


Gin 


Phe 


Gin 


Phe 










45 










50 










55 








cgc 


ttc 


cgc 


cgc 


tgg 


aat 


tgc 


tec 


age 


cac 


age 


aag 


gec 


ttt 


gga 


cgc 


405 


Arg 


Phe 


Arg 


Arg 


Trp 


Asn 


Cys 


Ser 


Ser 


His 


Ser 


Lys 


Ala 


Phe 


Gly 


Arg 








60 










65 










70 










ate 


ctg 


caa 


cag 


ggt 


cag 


tgt 


ggg 


gag 


ggg 


cac 


cct 


gca 


agg 


ace 


ctg 


453 


He 


Leu 


Gin 


Gin 


Gly 


Gin 


Cys 


Gly 


Glu 


Gly 


His 


Pro 


Ala 


Arg 


Thr 


Leu 






75 










80 










85 












cct 


ccc 


agg 


ccc 


ctg 


ggg 


cag 


ccc 


tec 


cgc 


cgc 


agg 


ttt 


cag 


gtc 


cca 


501 


Pro 


Pro 


Arg 


Pro 


Leu 


Gly 


Gin 


Pro 


Ser 


Arg 


Arg 


Arg 


Phe 


Gin 


Val 


Pro 




90 










95 










100 










105 





ggc ccc age tgaccgcccc agcccgcgct gattgeaect gtctgeatte 550 
Gly Pro Ser 
acagacattc gggagaegge 
gtcaegcagg cctgttctat 
cgggcccctc cccggccctc 
tccccggaag gcagcgccgc 
gacgagaagt cgaggctctt 
gcgttggtgc aactgeacaa 
accgagtgea aatgecaegg 
ctgcctccat ttcgcgaggt 
gtcatgggca ccaacgacgg 
ggecgagegg acctcctcta 
accggctccc ccggcacgcg 

79 



cttcgtgttc gccatcactg cggccggcgc cagccacgcc 610 
gggcgagctg ctgcagtgcg gctgccaggc gccccgcggg 670 
cggcctgccc ggcacccccg gaccccctgg ccccgcgggc 730 
ctgggagtgg ggaggctgcg gcgacgacgt ggacttcggg 790 
tatggacgcg cggcacaagc ggggacgegg agacatccgc 850 
caacgaggcg ggcaggctgg ccgtgcggag ccacacgcgc 910 
gctgtcggga tcatgcgcgc tgcgcacctg ctggcagaag 970 
gggegegegg ctgctggagc gettycaegg cgcctcacgc 1030 
caaggccctg ctgcccgccg tccgcacgct caagccgccg 1090 
cgccgccgat tcgcccgact tctgcgcccc caaccgacgc 1150 
cggtcgcgcc tgeaatagea gcgccccgga cctcagcggc 1210 
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tgcgacctgc tgtgctgcgg 
tgcctgtgcc gcttccactg 
gagctcagcc tctgcctgtg 
gctcgcacct gtgggacctc 
cctctccctg ccaaagccca 
aggaacgccc acccacgaag 
cgtttaaagg acactgtaca 
tagactggaa aaaagccagt 
cacaggatgg tgggtgaggt 
aaaa 

<210> 86 

<211> 131 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> 1. .24 

<400> 86 



Met Leu 


Pro 


Pro 


Leu Pro Ser Arg Leu Gly Leu 


Leu 


Leu 


Leu 


Leu 


Leu 








-20 -15 








-10 




Leu Cys 


Pro 


Ala 
-5 


His Val Gly Gly Leu Trp Trp 


Ala 


Val 


Gly 


Ser 


Pro 


Leu Val 


Met 


Asp 


1 

Pro Thr Ser lie Cys Arg Lys 


Ala 


5 

Arg 


Arg 


Leu 


Ala 


10 






15 


20 










Gly Arg 


Gin 


Ala 


Glu Leu Cys Gin Ala Glu Pro 


Glu 


Val 


Val 


Ala 


Glu 


25 






30 35 










40 


Leu Ala 


Arg 


Gly 


Ala Arg Leu Gly Val Arg Glu 


Cys 


Gin 


Phe 


Gin 


Phe 








45 50 








55 




Arg Phe 


Arg 


Arg. 


Trp Asn Cys Ser Ser His Ser 


Lys 


Ala 


Phe 


Gly 


Arg 






60 


65 






70 






lie Leu 


Gin 


Gin 


Gly Gin Cys Gly Glu Gly His 


Pro 


Ala 


Arg 


Thr 


Leu 




75 




80 




85 








Pro Pro 


Arg 


Pro 


Leu Gly Gin Pro Ser Arg Arg 


Arg 


Phe 


Gin 


Val 


Pro 


90 






95 


100 











Gly Pro Ser 
105 

<210> 87 

<211> 1431 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5'UTR 
<222> 1. .151 

<220> 
<221> CDS 
<222> 152. .655 

<220> 

<221> 3'UTR 
<222> 656.. 1431 

<220> 

<22l> polyA_signal 
<222> 1399. .1404 

<220> 

<221> polyA_site 



ccgcgggcac cgccaggaga 
gtgctgcgta gtacagtgcc 
acccgccgcc cggccgctag 
agggcaccgg caccgggcgc 
actcccaggg ctctggaaat 
gcccagggcg ccagacggcc 
ggccctccct ccccttggcc 
ctaaaggcct ctggatactg 
tagtatcaat aaagatattt 



gcgtgcagct cgaagagaac 1270 

accgctgccg tgtgcgcaag 1330 

actgacttcg cgcagcggtg 1390 

ctctcgccgc tcgagcccag 1450 

ggtgaggcga ggggcttgag 1510 

ccgaaaaggc gctcggggag 1570 

tctaggagga aacagttttt 1630 

ggctccccag aactgctggc 1690 

aaaccaccaa aaaaaaaaaa 1750 

1754. 



80 
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<222> 1416. .1431 
<400> 87 

aattttttct cacaaggact gggtgaagag ttctgcagcc ttacagagac tggaaaagaa 60 
gcccaaacca aggcccccag agaggtcccc caggcccctt tgggtccctg agcctcagct 120 
ggagatccgg cgcaggagac caacgcctgc c atg ctg ttc egg etc tea gag 172 

Met Leu Phe Arg Leu Ser Glu 

1 5 



cac 


tec 


tea 


cca 


gag 


gag 


gaa 


gee 


tec 


ccc 


cac 


cag 


aga gee 


4- ,_, — 


gga 


U 


His 


Ser 


Ser 
10 


Pro 


Glu 


Glu 


Glu 


Ala 
15 


Ser 


Pro 


His 


Gin 


Arg Ala 
20 


Ser 


Gly 




9ag 


999 


cac 


cat 


etc 


aag 


teg 


aag 


aga 


ccc 


aac 


ccc 


tgt gee 


tac 


aca 


268 


Glu 


Gly 
25 


His 


His 


Leu 


Lys 


Ser 
30 


Lys 


Arg 


Pro 


Asn 


Pro 
35 


Cys Ala 


Tyr 


Thr 




cca 


cct 


teg 


ctg 


aaa 


get 


gtg 


cag 


cgc 


att 


get 


gag 


tct cac 


ctg 


cag 


316 


Pro 


Pro 


Ser 


Leu 


Lys 


Ala 


Val 


Gin 


Arg 


lie 


Ala 


Glu 


Ser His 


Leu 


Gin 




40 










45 










50 








55 




tct 


ate 


age 


aat 


ttg 


aat 


gag 


aac 


cag 


gee 


tea 


gag 


gag gag 


gat 


gag 


364 


Ser 


lie 


Ser 


Asn 


Leu 
60 


Asn 


Glu 


Asn 


Gin 


Ala 
65 


Ser 


Glu 


Glu. Glu 


Asp 
70 


Glu 




ctg 


ggg 


gag 


ctt 


egg 


gag 


ctg 


ggt 


tat 


cca 


aga 


gag 


gaa gat 


gag 


gag 


412 


Leu 


Gly 


Glu 


Leu 
75 


Arg 


Glu 


Leu 


Gly 


Tyr 
80 


Pro 


Arg 


Glu 


Glu Asp 
85 


Glu 


Glu 




gaa 


gag 


gag 


gat 


gat 


gaa 


gaa 


gag 


gaa 


gaa 


gaa 


gag 


gac age 


cag 


get 


460 


Glu 


Glu 


Glu 
90 


Asp 


Asp 


Glu 


Glu 


Glu 
95 


Glu 


Glu 


Glu 


Glu 


Asp Ser 
100 


Gin 


Ala 




gaa 


gtc 


ctg 


aag 


gtc 


ate 


agg 


cag 


tct 


get 


ggg 


caa 


aag aca 


ace 


tgt 


508 


Glu 


Val 
105 


Leu 


Lys 


Val 


lie 


Arg 
110 


Gin 


Ser 


Ala 


Gly 


Gin 
115 


Lys Thr 


Thr 


Cys 




ggc 


cag 


ggt 


ctg 


gaa 


ggg 


ccc 


tgg 


gag 


cgc 


cca 


ccc 


cct ctg 


gat 


gag 


556 


Gly 


Gin 


Gly 


Leu 


Glu 


Gly 


Pro 


Trp 


Glu 


Arg 


Pro 


Pro 


Pro Leu 


Asp 


Glu 




12 0 










125 










130 








135 




tec 


gag 


aga 


gat 


gga 


ggc 


tct 


gag 


gac 


caa 


gtg 


gaa 


gac cca 


gca 


eta 


604 


Ser 


Glu 


Arg 


Asp 


Gly 
140 


Gly 


Ser 


Glu 


Asp 


Gin 
145 


Val 


Glu 


Asp Pro 


Ala 
150 


Leu 




agt 


gag 


cct 


ggg 


gag 


gaa 


cct 


cag 


cgc 


cct 


tec 


ccc 


tct gag 


cct 


ggc 


652 


Ser 


Glu 


Pro 


Gly 
155 


Glu 


Glu 


Pro 


Gin 


Arg 
160 


Pro 


Ser 


Pro 


Ser Glu 
165 


Pro 


Gly 





aca taggcaccca gcctgcatct cccaggagga agtggagggg acategctgt 705 
Thr 



tccccagaaa cccactctat cctcaccctg ttttgtgctc ttcccctcgc ctgetaggge 765 
tgeggcttet gacttctaga agactaaggc tggtctgtgt ttgcttgttt gcccaccttt 825 
ggctgatacc cagagaacct gggcacttgc tgcctgatgc ccacccctgc cagtcattcc 885 
tccattcacc cagegggagg tgggatgtga gacagcccac attggaaaat ccagaaaacc 945 
gggaacaggg atttgecett cacaattcta ctccccagat cctctcccct ggacacagga 1005 
gacccacagg gcaggaccct aagatctggg gaaaggaggt cctgagaacc ttgaggtacc 1065 
cttagatcct tttctaccca ctttcctatg gaggattcca agtcaccact tctctcaccg 1125 
gcttctacca gggtccagga etaaggegtt tttctccata gcctcaacat tttgggaatc 1185 
ttcccttaat cacccttgct cctcctgggt gectggaaga tggactggca gagacctctt 1245 
tgttgcgttt tgtgctttga tgccaggaat gccgcctagt ttatgtcccc ggtggggcac 1305 
acageggggg gcgccaggtt ttccttgtcc cccagctgct ctgccccttt ccccttcttc 1365 
cctgactcca ggcctgaacc cctcccgtgc tgtaataaat ctttgtaaag aaaaaaaaaa 1425 
aaaaaa 1431 

<210> 88 

<211> 168 

<212> PRT 

<213> Homo sapiens 

<400> 88 

Met Leu Phe Arg Leu Ser Glu His Ser Ser Pro Glu Glu Glu Ala Ser 

15 10 15 

pro His Gin Arg Ala Ser Gly Glu Gly His His Leu Lys Ser Lys Arg 
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Pro 


Asn 


Pro 


cys Aia 


Tyr 


inr 


Pro 


Pro 


O A V 

ber 


Leu Lys 


Ala 


val 


Gin 


Arg 






«5D 








*i u 




















pi n 
17J.U 


ber xiis 


Leu 




Ser 


lie 


Ser 


Asn Leu 


Asn 


UlU 


Asn 


Gin 












55 








tin 










AT a 


Cor 


m n 

blU 


r2in rain 


ASp 


nl n 


Leu 


vixy 


pi ii 


Lieu Arg 


r*l ii 
LxlU 


Leu 


HI ir 


Tyr 


65 








70 


















Q O 
o U 




Arg 


uiU 


ulU J\Sp 


vjIU 


rai ii 


f21 ii 

VjJLU 


r»i ii 


r»1 ii 
uXU 


Asp Asp 


bill 


G1U 


bill 


Glu 


























QC 




m ii 

uiu 


La ±11 






r*i n 




ulU 


vai 


Leu 


ijys val 


lie 


Arg 


Gin 


Ser 








100 








X VJ -3 








iin 

XiU 






Ala 


Gly 


Gin 


Lys Thr 


Thr 


Cys 


Gly 


Gin 


Gly 


Leu Glu 


Gly 


Pro 


Trp 


Glu 






115 








120 








125 








Arg 


Pro 


Pro 


Pro Leu 


Asp 


Glu 


Ser 


Glu 


Arg 


Asp Gly 


Gly 


Ser 


Glu 


Asp 




130 








135 








140 










Gin 


Val 


Glu 


Asp Pro 


Ala 


Leu 


Ser 


Glu 


Pro 


Gly Glu 


Glu 


Pro 


Gin 


Arg 


145 








150 










155 








160 


Pro 


Ser 


Pro 


Ser Glu 


Pro 


Gly 


Thr 

















165 



<210> 89 
<211> 1431 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> 5'UTR 
<222> 1. .151 

<220> 
<221> CDS 
<222> 152. .655 

<220> 

<221> 3'UTR 
<222> 656. .1431 

<220> 

<221> polyA_signal 
<222> 1399. .1404 

<220> 

<221> polyA_site 
<222> 1416.. 1431 

<400> 89 

aattttttct cacaaggact gggtgaagag ttctgcagcc ttacagagac tggaaaagaa 60 
gcccaaacca aggcccccag agaggtcccc caggcccctt tgggtccctg agcctcagct 120 
ggagatccgg cgcaggagac caacgcctgc c atg ctg ttc egg etc tea gag 172 

Met Leu Phe Arg Leu Ser Glu 

1 5 



cac 


tec tea 


cca 


gag 


gag 


gaa 


gee 


tec 


ccc 


cac 


cag 


aga 


gee 


tea 


gga 


220 


His 


Ser Ser 
10 


Pro 


Glu 


Glu 


Glu 


Ala 
15 


Ser 


Pro 


His 


Gin 


Arg 
20 


Ala 


Ser 


Gly 




gag 


ggg cac 


cat 


etc 


aag 


teg 


aag 


aga 


ccc 


aac 


ccc 


tgt 


gee 


tac 


aca 


268 


Glu Gly His 


His 


Leu 


Lys 


Ser 


Lys 


Arg 


Pro 


Asn 


Pro 


Cys 


Ala 


Tyr 


Thr 






25 








30 










35 












cca 


cct teg 


ctg 


aaa 


get 


gtg 


cag 


cgc 


att 


get 


gag 


tct 


cac 


ctg 


cag 


316 


Pro 


Pro Ser 


Leu 


Lys 


Ala 


Val 


Gin 


Arg 


lie 


Ala 


Glu 


Ser 


His 


Leu 


Gin 




40 








45 










50 










55 




tct 


ate age 


aat 


ttg 


aat 


gag 


aac 


cag 


gec 


tea 


gag 


gag 


gag 


gat 


gag 


364 


Ser 


He Ser 


Asn 


Leu 
60 


Asn 


Glu 


Asn 


Gin 


Ala 
65 


Ser 


Glu 


Glu 


Glu 


Asp 
70 


Glu 
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ctg 


ggg 


gag 


ctt 


egg 


gag 


ctg 


ggt 


tat 


cca 


aga 


gag 


gaa 


gat 


gag 


gag 


412 


Leu 


Gly 


Glu 


Leu 


Arg 


Glu 


Leu 


Gly 


Tyr 


Pro 


Arg 


Glu 


Glu 


Asp 


Glu 


Glu 










75 










80 










85 








gaa 


gag 


gag 


gat 


gat 


gaa 


gaa 


gag 


gaa 


gaa 


gaa 


gag 


gac 


age 


cag 


get 


460 


Glu 


Glu 


Glu 


Asp 


Asp 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Asp 


Ser 


Gin 


Ala 








90 










95 










100 










gaa 


gtc 


ctg 


aag 


gtc 


ate 


agg 


cag 


tct 


get 


ggg 


caa 


aag 


aca 


ace 


tgt 


508 


Glu 


Val 


Leu 


Lys 


Val 


lie 


Arg 


Gin 


Ser 


Ala 


Gly 


Gin 


Lys 


Thr 


Thr 


Cys 






105 










110 










115 












ggc 


cag 


ggt 


ctg 


gaa 


ggg 


ccc 


tgg 


gag 


cgc 


cca 


ccc 


cct 


ctg 


gat 


gag 


556 


Gly 


Gin 


Gly 


Leu 


Glu 


Gly 


Pro 


Trp 


Glu 


Arg 


Pro 


Pro 


Pro 


Leu 


Asp 


Glu 




120 










125 










130 










135 




tec 


gag 


aga 


gat 


gga 


ggc 


tct 


gag 


gac 


caa 


gtg 


gaa 


gac 


cca 


gca 


eta 


604 


Ser 


Glu 


Arg 


Asp 


Gly 


Gly 


Ser 


Glu 


Asp 


Gin 


Val 


Glu 


Asp 


Pro 


Ala 


Leu 












140 










145 










150 






agt 


gag 


cct 


ggg 


gag 


gaa 


cct 


cag 


cgc 


cct 


tec 


ccc 


tct 


gag 


cct 


ggc 


652 


Ser 


Glu 


Pro 


Gly 


Glu 


Glu 


Pro 


Gin 


Arg 


Pro 


Ser 


Pro 


Ser 


Glu 


Pro 


Gly 





155 160 165 



aca taggcaccca gcctgcatct cccaggagga agtggagggg acategctgt 705 
Thr 

tccccagaaa cccactctat cctcaccctg ttttgtgctc ttcccctcgc ctgetaggge 765 
tgeggcttet gacttctaga agactaaggc tggtctgtgt ttgcttgttt gcccaccttt 825 
ggctgatacc cagagaacct gggcacttgc tgcctgatgc ccacccctgc cagtcattcc 885 
tccattcacc cagegggagg tgggatgtga gacagcccac attggaaaat ccagaaaacc 945 
gggaacaggg atttgecett cacaattcta ctccccagat cctctcccct ggacacagga 1005 
gacccacagg gcaggaccct aagatctggg gaaaggaggt cctgagaacc ttgaggtacc 1065 
cttagatcct tttctaccca ctttcctatg gaggattcca agtcaccact tctctcaccg 1125 
gcttctacca gggtccagga etaaggegtt tttctccata gcctcaacat tttgggaatc 1185 
ttcccttaat cacccttgct cctcctgggt gectggaaga tggactggca gagacctctt 1245 
tgttgcgttt tgtgctttga tgccaggaat gccgcctagt ttatgtcccc ggtggggcac 1305 
acageggggg gcgccaggtt ttccttgtcc cccagctgct ctgccccttt ccccttcttc 1365 
cctgactcca ggcctgaacc cctcccgtgc tgtaataaat ctttgtaaag aaaaaaaaaa 1425 
aaaaaa 1431 

<210> 90 

<211> 168 

<212> PRT 

<213> Homo sapiens 



<400> 90 



Met 


Leu 


Phe 


Arg 


Leu 


Ser 


Glu 


His 


Ser 


Ser 


Pro 


Glu 


Glu 


Glu 


Ala 


Ser 


1 








5 










10 










15 




Pro 


His 


Gin 


Arg 
20 


Ala 


Ser 


Gly 


Glu 


Gly 
25 


His 


His 


Leu 


Lys 


Ser 
30 


Lys 


Arg 


Pro 


Asn 


Pro 
35 


Cys 


Ala 


Tyr 


Thr 


Pro 
40 


Pro 


Ser 


Leu 


Lys 


Ala 
45 


Val 


Gin 


Arg 


lie 


Ala 
50 


Glu 


Ser 


His 


Leu 


Gin 
55 


Ser 


He 


Ser 


Asn 


Leu 
60 


Asn 


Glu 


Asn 


Gin 


Ala 


Ser 


Glu 


Glu 


Glu 


Asp 


Glu 


Leu 


Gly 


Glu 


Leu 


Arg 


Glu 


Leu 


Gly 


Tyr 


65 










70 


i 








75 










80 


Pro 


Arg 


Glu 


Glu 


Asp 
85 


Glu 


Glu 


Glu 


Glu 


Glu 
90 


Asp 


Asp 


Glu 


Glu 


Glu 
95 


Glu 


Glu 


Glu 


Glu 


Asp 
100 


Ser 


Gin 


Ala 


Glu 


Val 
105 


Leu 


Lys 


Val 


He 


Arg 
110 


Gin 


Ser 


Ala 


Gly 


Gin 
115 


Lys 


Thr 


Thr 


Cys 


Gly 
120 


Gin 


Gly 


Leu 


Glu 


Gly 
125 


Pro 


Trp 


Glu 


Arg 


Pro 
130 


Pro 


Pro 


Leu 


Asp 


Glu 
135 


Ser 


Glu 


Arg 


Asp 


Gly 
140 


Gly 


Ser 


Glu 


Asp 


Gin 


Val 


Glu 


Asp 


Pro 


Ala 


Leu 


Ser 


Glu 


Pro 


Gly 


Glu 


Glu 


Pro 


Gin 


Arg 


145 










150 










155 










160 


Pro 


Ser 


Pro 


Ser 


Glu 


Pro 


Gly 


Thr 



















165 
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<210> 91 

<211> 1417 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> 5 ! UTR 

<222> 1..47 

<220> 
<221> CDS 
<222> 48. .1301 

<220> 

<221> 3'UTR 
<222> 1302. .1417 

<220> 

<221> polyA_signal 
<222> 1360. .1365 

<220> 

<221> polyA_site 
<222> 1402. .1417 



<400> 91 

ctcctcagct tcaggcacca ccactgacct gggacagtga atcgaca atg ccg tct 56 

Met Pro Ser 



tct 


gtc 


teg 


tgg 


ggc 


ate 


etc 


ctg 


ctg 


gca 


ggc ctg 


tgc 


tgc ctg 


gtc 


104 


Ser 


val 


Ser 


Trp 


Gly 


He 


Leu 


Leu 


Leu 


Ala 


Gly Leu 


Cys 


Cys Leu 


Val 




-20 










-15 










-10 






-5 




cct 


gtc 


tec 


ctg 


get 


gag 


gat 


ccc 


cag 


gga 


gat get 


gcc 


cag aag 


aca 


152 


Pro 


Val 


Ser 


Leu 


Ala 

1 


Glu 


Asp 


Pro 


Gin 
5 


Gly 


Asp Ala 


Ala 


Gin Lys 
10 


Thr 




gat 


aca 


tec 


cac 


cat 


gat 


cag 


gat 


cac 


cca 


acc ttc 


aac 


aag ate 


acc 


200 


Asp 


Thr 


Ser 


His 


His 


Asp 


Gin 


Asp 


His 


Pro 


Thr Phe 


Asn 


Lys He 


Thr 








15 










20 








25 








ccc 


aac 


ctg 


get 


gag 


ttc 


gcc 


ttc 


age 


eta 


tac cgc 


cag 


ctg gca 


cac 


248 


Pro 


Asn 


Leu 


Ala 


Glu 


Phe 


Ala 


Phe 


Ser 


Leu 


Tyr Arg 


Gin 


Leu Ala 


His 






30 










35 








40 










cag 


tec 


aac 


age 


ace 


aat 


ate 


ttc 


ttc 


tec 


cca gtg 


age 


ate get 


aca 


296 


Gin 


Ser 


Asn 


Ser 


Thr 


Asn 


He 


Phe 


Phe 


Ser 


Pro Val 


Ser 


He Ala 


Thr 




45 










50 










55 






60 




gcc 


ttt 


gca 


atg 


etc 


tec 


ctg 


ggg 


acc 


aag 


get gac 


act 


cac gat 


gaa 


344 


Ala 


Phe 


Ala 


Met 


Leu 


Ser 


Leu 


Gly 


Thr 


Lys 


Ala Asp 


Thr 


His Asp 


Glu 












65 










70 






75 






ate 


ctg 


gag 


age 


ctg 


aat 


ttc 


aac 


etc 


acg 


gag att 


ccg 


gag get 


cag 


392 


He 


Leu 


Glu 


Ser 


Leu 


Asn 


Phe 


Asn 


Leu 


Thr 


Glu lie 


Pro 


Glu Ala 


Gin 










80 










85 








90 






ate 


cat 


gaa 


ggc 


ttc 


cag 


gaa 


etc 


etc 


cgt 


acc etc 


aac 


cag cca 


gac 


440 


He 


His 


Glu 


Gly 


Phe 


Gin 


Glu 


Leu 


Leu 


Arg 


Thr Leu 


Asn 


Gin Pro 


Asp 








95 










100 








105 








age 


cag 


etc 


cag 


ctg 


acc 


acc 


ggc 


aat 


ggc 


ctg ttc 


etc 


age gag 


ggc 


488 


Ser 


Gin 


Leu 


Gin 


Leu 


Thr 


Thr 


Gly 


Asn 


Gly 


Leu Phe 


Leu 


Ser Glu 


Gly 






110 










115 






120 










ctg 


aag 


eta 


gtg 


gat 


aag 


ttt 


ttg 


gag 


gat 


gtt aaa 


aag 


ttg tac 


cac 


536 


Leu 


Lys 


Leu 


Val 


Asp 


Lys 


Phe 


Leu 


Glu 


Asp 


Val Lys 


Lys 


Leu Tyr 


His 




125 










130 










135 






140 




tea 


gaa 


gcc 


ttc 


act 


gtc 


aac 


ttc 


ggg 


gac 


acc gaa 


gag 


gcc aag 


aaa 


584 


Ser 


Glu 


Ala 


Phe 


Thr 


Val 


Asn 


Phe 


Gly 


Asp 


Thr Glu 


Glu 


Ala Lys 


Lys 












145 










150 






155 






cag 


ate 


aac 


gat 


tac 


gtg 


gag 


aag 


ggt 


act 


caa ggg 


aaa 


att gtg 


gat 


632 


Gin 


He 


Asn 


Asp 


Tyr 


Val 


Glu 


Lys 


Gly 


Thr 


Gin Gly 


Lys 


He Val 


Asp 
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160 165 170 



ttg 


gtc 


aag 


gag 


ctt 


gac 


aga 


gac 


aca 


gtt 


ttt 


get 


ctg 


gtg 


aat 


tac 


680 


Leu 


Val 


Lys 
175 


Glu 


Leu 


Asp 


Arg 


Asp 
180 


Thr 


Val 


Phe 


Ala 


Leu 
185 


Val 


Asn 


Tyr 




ate 


ttc 


ttt 


aaa 


ggc 


aaa 


tgg 


gag 


aga 


ccc 


ttt 


gaa 


gtc 


aag 


gac 


acc 


728 


lie 


Phe 
190 


Phe 


Lys 


Gly 


Lys 


Trp 
195 


Glu 


Arg 


Pro 


Phe 


Glu 
200 


Val 


Lys 


Asp 


Thr 




9ag 


gaa 


gag 


gac 


ttc 


cac 


gtg 


gac 


cag 


gcg 


acc 


acc 


gtg 


aag 


gtg 


cct 


776 


Glu 


Glu 


Glu 


Asp 


Phe 


His 


Val 


Asp 


Gin 


Ala 


Thr 


Thr 


Val 


Lys 


Val 


Pro 




205 










210 










215 










220 




atg 


atg 


aag 


cgt 


tta 


ggc 


atg 


ttt 


aac 


ate 


cag 


cac 


tgt 


aag 


aag 


ctg 


824 


Met 


Met 


Lys 


Arg 


Leu 
225 


Gly 


Met 


Phe 


Asn 


He 
230 


Gin 


His 


Cys 


Lys 


Lys 
235 


Leu 




tec 


age 


tgg 


gtg 


ctg 


ctg 


atg 


aaa 


tac 


ctg 


ggc 


aat 


gee 


acc 


gec 


ate 


872 


Ser 


Ser 


Trp 


Val 
240 


Leu 


Leu 


Met 


Lys 


Tyr 
245 


Leu 


Gly 


Asn 


Ala 


Thr 
250 


Ala 


lie 




ttc 


ttc 


ctg 


cct 


gat 


gag 


ggg 


aaa 


eta 


cag 


cac 


ctg 


gaa 


aat 


gaa 


etc 


920 


Phe 


Phe 


Leu 
255 


Pro 


Asp 


Glu 


Gly 


Lys 
260 


Leu 


Gin 


His 


Leu 


Glu 
265 


Asn 


Glu 


Leu 




acc 


cac 


gat 


ate 


ate 


acc 


aag 


ttc 


ctg 


gaa 


aat 


gaa 


gac 


aga 


agg 


tct 


968 


Thr 


His 
270 


Asp 


He 


He 


Thr 


Lys 
275 


Phe 


Leu 


Glu 


Asn 


Glu 
280 


Asp 


Arg 


Arg 


Ser 




gec 


age 


tta 


cat 


tta 


ccc 


aaa 


ctg 


tec 


att 


act 


gga 


acc 


tat 


gat 


ctg 


1016 


Ala 


Ser 


Leu 


His 


Leu 


Pro 


Lys 


Leu 


Ser 


He 


Thr 


Gly 


Thr 


Tyr 


Asp 


Leu 




285 










290 










295 










300 




aag 


age 


gtc 


ctg 


ggt 


caa 


ctg 


ggc 


ate 


act 


aag 


gtc 


ttc 


age 


aat 


ggg 


1064 


Lys 


Ser 


Val 


Leu 


Gly 
305 


Gin 


Leu 


Gly 


He 


Thr 
310 


Lys 


Val 


Phe 


Ser 


Asn 
315 


Gly 




get 


gac 


etc 


tec 


ggg 


gtc 


aca 


gag 


gag 


gca 


ccc 


ctg 


aag 


etc 


tec 


aag 


1112 


Ala 


Asp 


Leu 


Ser 
320 


Gly 


Val 


Thr 


Glu 


Glu 
325 


Ala 


Pro 


Leu 


Lys 


Leu 
330 


Ser 


Lys 




gec 


gtg 


cat 


aag 


get 


gtg 


ctg 


acc 


ate 


gac 


gag 


aaa 


ggg 


act 


gaa 


get 


1160 


Ala 


Val 


His 
335 


Lys 


Ala 


Val 


Leu 


Thr 
340 


He 


Asp 


Glu 


Lys 


Gly 
345 


Thr 


Glu 


Ala 




get 


ggg 


gee 


atg 


ttt 


tta 


gag 


gee 


at a 


ccc 


atg 


tct 


ate 


ccc 


ccc 


gag 


1208 


Ala 


Gly 
350 


Ala 


Met 


Phe 


Leu 


Glu 
355 


Ala 


He 


Pro 


Met 


Ser 
360 


He 


Pro 


Pro 


Glu 




gtc 


aag 


ttc 


aac 


aaa 


ccc 


ttt 


gtc 


ttc 


tta 


atg 


att 


gaa 


caa 


aat 


acc 


1256 


val 


Lys 


Phe 


Asn 


Lys 


Pro 


Phe 


Val 


Phe 


Leu 


Met 


He 


Glu 


Gin 


Asn 


Thr 




365 










370 










375 










380 




aag 


tct 


ccc 


etc 


ttc 


atg 


gga 


aaa 


gtg 


gtg 


aat 


ccc 


acc 


caa 


aaa 




1301 


Lys 


Ser 


Pro 


Leu 


Phe 
385 


Met 


Gly 


Lys 


Val 


Val 
390 


Asn 


Pro 


Thr 


Gin 


Lys 
395 







taactgcctc tcgctcctca acccctcccc tccatccctg gccccctccc tggatgacat 1361 
taaagaaggg ttgagctggt ccctgcctgc atgtgactgc aaaaaaaaaa aaaaaa 1417 



<210> 92 
<211> 418 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> 1. .24 

<400> 92 
Met Pro Ser Ser 

Cys Leu Val Pro 
-5 

Gin Lys Thr Asp 
10 

Lys He Thr Pro 



Val Ser Trp Gly lie Leu Leu Leu Ala Gly Leu Cys 
-20 -15 " -10 

Val Ser Leu Ala Glu Asp Pro Gin Gly Asp Ala Ala 

1 5 
Thr Ser His His Asp Gin Asp His Pro Thr Phe Asn 

15 20 
Asn Leu Ala Glu Phe Ala Phe Ser Leu Tyr Arg Gin 
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*? c 
^5 






30 




Leu Ala 


HIS 


Gin 


Ser Asn 


Ser Thr 








A C 

45 




Tin J1- 

xie Ala 


Thr 


Ala 


Fne Aia 


Met Leu 












".IS ASp 


VjIU 


lie 


Leu Glu 


Ser Leu 




n c 
/5 






q n 


«XU Ala 


vjin 


lie 


HIS bill 


Caiy Fne 












uin Fro 


7V 

ASp 


Ser 


Cjin lieu 


Gin lieu 


-uu d 






11 A 

110 




Ser Glu 


Caly 


T ««« 

Leu 


Lys Leu 


val Asp 








IOC 




Leu Tyr 


xllS 


ser 


Glu Ala 


Phe Thr 






I/in 

14 U 






Aia Lys 


iiys 


Cain 


Tl - 71 MM 

lie Asn 


Asp Tyr 




ICC 

15b 






160 


lie vai 


Asp 


Leu 


Val Lys 


Glu Leu 


J. /U 








175 


Val Asn 


Tyr 


lie 


Phe Phe 


Lys Gly 


IOC 

155 






190 




Lys Asp 


Thr 


Glu 


Glu Glu 


Asp Phe 








205 




Lys val 


Pro 


Met 


Met Lys 


Arg Leu 






220 






Lys Lys 


Leu 


Ser 


Ser Trp 


Val Leu 




"5 "3 C 

A .35 






240 


Tnr Ala 


lie 


Pne 


Phe Leu 


Pro Asp 










o c c 
Z55 


Asn Glu 


Leu 


Thr 


His Asp 


He He 


1 c c 






270 




Arg Arg 


Ser 


Ala 


Ser Leu 


His Leu 








285 




Tyr Asp 


Leu 


Lys 


Ser Val 


Leu Gly 






300 






Ser Asn 


Gly 


Ala 


Asp Leu 


Ser Gly 




O ID 






1 "> c\ 


Leu Ser 


Lys 


Ala 


Val His 


Lys Ala 


330 








335 


Thr Glu 


Ala 


Ala 


Gly Ala 


Met Phe 


345 






350 




Pro Pro 


Glu 


Val 


Lys Phe 


Asn Lys 








365 




Gin Asn 


Thr 


Lys 


Ser Pro 


Leu Phe 



380 



Gin Lys 

<210> 93 

<211> 1115 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5 * UTR 
<222> 1. .277 

<220> 
<221> CDS 
<222> 278. .733 

<220> 

<221> 3'UTR 
<222> 734. .1115 







35 




40 


Asn 


He 


Phe 


Phe Ser Pro Val 


Ser 




50 




55 




Ser 


Leu 


Gly 


Thr Lys Ala Asp 


Thr 


65 






70 




Asn 


Phe 


ABn 


Leu Thr Glu He 


Pro 








85 




Gin 


Glu 


Leu 


Leu Arg Thr Leu 


Asn 








100 




Thr 


Thr 


Gly 


Asn Gly Leu Phe 


Leu 






115 




120 


Lys 


Phe 


Leu 


Glu Asp Val Lys 


Lys 




130 




135 




Val 


Asn 


Phe 


Gly Asp Thr Glu 


Glu 


145 






150 




Val 


Glu 


Lys 


Gly Thr Gin Gly 


Lys 








165 




Asp Arg 


Asp 


Thr Val Phe Ala 


Leu 








180 




Lys 


Trp 


Glu 


Arg Pro Phe Glu 


Val 






195 




200 


His 


Val 


Asp 


Gin Ala Thr Thr 


Val 




210 




215 




Gly Met 


Phe 


Asn He Gin His 


Cys 


225 






230 




Leu 


Met 


Lys 


Tyr Leu Gly Asn 


Ala 








245 




Glu Gly 


Lys 


Leu Gin His Leu 


Glu 








260 




Thr Lys 


Phe 


Leu Glu Asn Glu 


Asp 






275 




280 


Pro Lys 


Leu 


Ser He Thr Gly 


Thr 




290 




295 




Gin 


Leu 


Gly 


He Thr Lys Val 


Phe 


305 






310 




Val 


Thr 


Glu 


Glu Ala Pro Leu 


Lys 








325 




Val 


Leu 


Thr 


He Asp Glu Lys 


Gly 








340 




Leu 


Glu 


Ala 


He Pro Met Ser 


lie 






355 




360 


Pro 


Phe 


Val 


Phe Leu Met He 


Glu 




370 




375 




Met Gly 


Lys 


Val Val Asn Pro 


Thr 


385 






390 
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<220> 

<221> polyA_signal 
<222> 1072. .1077 

<220> 

<221> polyA_site 
<222> 1101. .1115 

<400> 93 

ctctttgctc taacagacag cagcgacttt aggctggata atagtcaaat tcttacctcg 60 
ctctttcact gctagtaaga tcagattgcg tttctttcag ttactcttca atcgccagtt 120 
tcttgatctg cttctaaaag aagaagtaga gaagataaat cctgtcttca atacctggaa 180 
ggaaaaacaa aataacctca actccgtttt gaaaaaaaca ttccaagaac tttcatcaga 240 
gattttactt agatgattta cacaatgaag aaagtac atg cac ttt ggg ctt ctg 295 

Met His Phe Gly Leu Leu 
-15 



tec 


ctg 


ctg 


ctt 


aat 


ctt 


gee 


cct 


gec 


cct 


ctt 


aat 


get 


gat 


tct 


gag 


343 


Ser 


Leu 


Leu 
-10 


Leu 


Asn 


Leu 


Ala 


Pro 
-5 


Ala 


Pro 


Leu 


Asn 


Ala 
1 


Asp 


Ser 


Glu 




gaa 


gau 


gaa 


gaa 


cac 


aca 


att 


ate 


aca 


gat 


acg 


gag 


ttg 


cca 


cca 


ctg 


391 


Glu 


Asp 


Glu 


Glu 


His 


Thr 


He 


He 


Thr 


Asp 


Thr 


Glu 


Leu 


Pro 


Pro 


Leu 




5 










10 










15 










20 




aaa 


ctt 


atg 


cat 


tea 


ttt 


tgt 


gca 


ttc 


aag 


gcg 


gat 


gat 


age 


cca 


tgt. 


439 


Lys 


Leu 


Met 


His 


Ser 
25 


Phe 


Cys 


Ala 


Phe 


Lys 
30 


Ala 


Asp 


Asp 


Ser 


Pro 
35 


Cys 




aaa 


gca 


ate 


atg 


aaa 


aga 


ttt 


ttc 


ttc 


aat 


att 


ttc 


act 


cga 


cag 


tgc 


487 


Lys 


Ala 


He 


Met 
40 


Lys 


Arg 


Phe 


Phe 


Phe 
45 


Asn 


He 


Phe 


Thr 


Arg 
50 


Gin 


Cys 




gaa 


gaa 


ttt 


ata 


tat 


ggg 


gga 


tgt 


gaa 


gga 


aat 


cag 


aat 


cga 


ttt 


gaa 


535 


Glu 


Glu 


Phe 
55 


lie 


Tyr 


Gly 


Gly 


Cys 
60 


Glu 


Gly 


Asn 


Gin 


Asn 
65 


Arg 


Phe 


Glu 




agt 


ctg 


gaa 


gag 


tgc 


aaa 


aaa 


atg 


tgt 


aca 


aga 


gat 


aat 


gca 


aac 


agg 


583 


Ser 


Leu 
70 


Glu 


Glu 


Cys 


Lys 


Lys 
75 


Met 


Cys 


Thr 


Arg 


Asp 
80 


Asn 


Ala 


Asn 


Arg 




att 


ata 


aag 


aca 


aca 


ttg 


caa 


caa 


gaa 


aag 


cca 


gat 


ttc 


tgc 


ttt 


ttg 


631 


He 


He 


Lys 


Thr 


Thr 


Leu 


Gin 


Gin 


Glu 


Lys 


Pro 


Asp 


Phe 


Cys 


Phe 


Leu 




85 










90 










95 










100 




gaa 


gaa 


gat 


cct 


gga 


ata 


tgt 


cga 


ggt 


tat 


att 


ace 


agg 


tat 


ttt 


tat 


679 


Glu 


Glu 


Asp 


Pro 


Gly 
105 


He 


Cys 


Arg 


Gly 


Tyr 
110 


He 


.Thr 


Arg 


Tyr 


Phe 
115 


Tyr 




aac 


aat 


cag 


aca 


aaa 


cat 


gtg 


aac 


gtt 


tea 


agt 


atg 


gtg 


gat 


gee 


tgg 


727 


Asn 


Asn 


Gin 


Thr 
120 


Lys 


His 


Val 


Asn 


Val 
125 


Ser 


Ser 


Met 


Val 


Asp 
130 


Ala 


Trp 




gca 


ata 


tgaacaattt tgagacactg gaagaatgea agaacatttg 


tgaagatggt 


783 



Ala He 

ccgaatggtt tccaggtgga taattatgga acccagctca atgctgtgaa taactccctg 843 

actccgcaat caaccaaggt tcccagcctt tttgttacaa aagaaggaac aaatgatggt 903 

tggaagaatg eggctcatat ttaccaagtc tttctgaacg ccttctgcat tcatgcatcc 963 

atgttctttc taggattgga tagcatttca tgcctatgtt aatatttgtg cttttggcat 1023 

ttccttaata tttatatgta tacgtgatgc ctttgatagc atactgetaa taaagtttta 1083 

atatttacat gcataggaaa aaaaaaaaaa aa 1115 

<210> 94 
<211> 152 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> 1. .19 

<400> 94 
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i Y iec 


HIS 


File Giy 


Leu 
-15 


Leu 


Ser 


Leu 


Leu 


Leu 
-10 


Asn Lieu 


ft! a 
Ala 


Pro 


Ala 
-5 


Pro 




Ash 


Ala 7\ c; y*v 
1 


Cor 




VjJLU 


Asp 
5 


Ui U 


V31U 


nio xin 


lie 

10 


lie 


111 XV 


Asp 


nix 


15 


JJcU riu 


Pro 


Leu 


Lys 
20 


Leu 


1*1 C L. 


XX-J a 

nio 


OCX, XT 11C 

25 




Ala. 


rile 


Lys 


Al a 


A en 
nop 


A an Coy* 
J-\.o }J Del 


"D YV\ 








Tl » 

lie 


1*1 c l» 


Lys Arg 


XT 11C 


rile 


rile 


Asn 


30 








35 










40 








45 


1 J. C 


IT I1C 


i iii- m, y 


nl n 
-? u 


Cys 


rsl n 

vJ-L U 




Jrlle 


lie 


iyi 01/ 


nl tt 


Cys 


o u 


vjiy 


■Moll 


bin 


ash Hig 
65 


Jrlle 


blU 


Ser 


Leu 


m ii 
70 


olU 




Lys 


wee 
75 


Cys 


Thr 


Arg 


Asp 


Asn A±a 
80 


Asn 


Arg 


lie 


lie 
85 


Lys 


inr 


im Leu 


vj±n 
90 


Gin 


GlU 


Lys 


Pro 


Asp 
95 


Phe Cys 


Phe 


Leu 


Glu 
100 


Glu 


Asp 


Pro 


Gly He 
105 


Cys 


Arg 


Gly 


Tyr 


He 


Thr 


Arg Tyr 


Phe 


Tyr 


Asn 


Asn 


Gin 


Thr 


Lys His 


Val 


Asn 


Val 


Ser 


110 








115 










120 








125 


Ser 


Met 


Val Asp 


Ala 
130 


Trp 


Ala 


He 

















<210> 95 
<211> 1307 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> 5'UTR 
<222> 1. .252 

<220> 
<221> CDS 
<222> 253. .744 

<220> 

<221> 3'UTR 
<222> 745. .1307 

<220> 

<221> polyA_signal 
<222> 1269. .1274 

<220> 

<221> polyA_site 
<222> 1292. .1307 

<400> 95 

ctctttgctc taacagacag cagegacttt aggctggata atagtcaaat tcttacctcg 60 
ctctttcact gctagtaaga teagattgeg tttctttcag ttactcttca ategecagtt 120 
tcttgatctg cttctaaaag aagaagtaga gaagataaat cctgtcttca atacctggaa 180 
ggaaaaacag aataacctca actccgtttt gaaaaaaaca ttccaagaac tttcatcaga 240 
gattttactt ag atg att tac aca atg aag aaa gta cat gca ctt tgg get 291 
Met He Tyr Thr Met Lys Lys Val His Ala Leu Trp Ala 
-25 -20 -15 



tct gta tgc ctg ctg 


ctt 


aat 


ctt 


gec cct 


gec 


cct 


ctt aat get gat 


Ser Val Cys Leu Leu 


Leu 


Asn 


Leu 


Ala Pro 


Ala 


Pro 


Leu Asn Ala Asp 


-10 








-5 






1 


tct gag gaa gat gaa 


gaa 


cac 


aca 


att ate 


aca 


gat 


acg gag ttg cca 


Ser Glu Glu Asp Glu 


Glu 


His 


Thr 


He lie 


Thr 


Asp 


Thr Glu Leu Pro 


5 






10 








15 


cca ctg aaa ctt atg 


cat 


tea 


ttt 


tgt gca 


ttc 


aag 


gcg gat gat ggc 


Pro Leu Lys Leu Met 


His Ser Phe Cys Ala 


Phe 


Lys 


Ala Asp Asp Gly 


20 




25 








30 
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cca 


tgt 


aaa 


gca 


ate 


atg 


aaa 


aga 


ttt 


ttc 


ttc 


aat 


att 


ttc 


act 


cga 


483 


Pro 


Cys 


Lys 


Ala 


He 


Met 


Lys 


Arg 


Phe 


Phe 


Phe 


Asn 


He 


Phe 


Thr 


Arg 




35 










40 










45 










50 




cag 


tgc 


gaa 


gaa 


ttt 


ata 


tat 


ggg 


gga 


tgt 


gaa 


gga 


aat 


cag 


aat 


cga 


531 


Gin 


Cys 


Glu 


Glu 


Phe 


He 


Tyr 


Gly 


Gly 


Cys 


Glu 


Gly 


Asn 


Gin 


Asn 


Arg 












55 










60 










65 






tut 


gaa 


agt 


ctg 


gaa 


gag 


tgc 


aaa 


aaa 


atg 


tgt 


aca 


aga 


gat 


aat 


gca 


579 


Phe 


Glu 


Ser 


Leu 


Glu 


Glu 


Cys 


Lys 


Lys 


Met 


Cys 


Thr 


Arg 


Asp 


Asn 


Ala 










70 










75 










80 








aac 


agg 


att 


ata 


aag 


aca 


aca 


ttg 


caa 


caa 


gaa 


aag 


cca 


gat 


ttc 


tgc 


627 


Asn 


Arg 


He 


He 


Lys 


Thr 


Thr 


Leu 


Gin 


Gin 


Glu 


Lys 


Pro 


Asp 


Phe 


Cys 








85 










90 










95 










ttt 


ttg 


gaa 


gaa 


gat 


cct 


gga 


ata 


tgt 


cga 


ggt 


tat 


att 


acc 


agg 


tat 


675 


Phe 


Leu 


Glu 


Glu 


Asp 


Pro 


Gly 


He 


Cys 


Arg 


Gly 


Tyr 


He 


Thr 


Arg 


Tyr 





100 105 110 

ttt tat aac aat cag aca aaa cag tgt gaa cgt ttc aag tat ggt gga 723 
Phe Tyr Asn Asn Gin Thr Lys Gin Cys Glu Arg Phe Lys Tyr Gly Gly 
115 120 125 " 130 

tgc ctg ggc aat caa caa ttt tgagacactg gaacaatgea agaacatttg 774 
Cys Leu Gly Asn Gin Gin Phe 
135 

tgaagatggt ccgaatggtt tccaggtgga taattatgga acccagctca atgctgtgaa 834 
taactccctg actccgcaat caaccaaggt tcccagcctt tttgaatttc acggtccctc 894 
atggtgtctc actccagcag acagaggatt gtgtcgtgcc aatgagaaca gattctacta 954 
caattcagtc attgggaaat gccgcccatt taagtacagt ggatgtgggg gaaatgaaaa 1014 
caattttact tccaaacaag aatgtctgag ggcatgtaaa aaaggtttca tccaaagaat 1074 
atcaaaagga ggectaatta aaaccaaaag aaaaagaaag aagcagagag tgaaaatagc 1134 
atatgaagaa atttttgtta aaaatatgtg aatttgttat agcaatgtaa cattaattct 1194 
actaaatatt ttatatgaaa tgtttcacta tgattttcta tttttcttct aaaatgcttt 1254 
taattaatat gttcattaaa ttttctatgc ttattgcaaa aaaaaaaaaa aaa 1307 

<210> 96 

<211> 164 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> 1. .28 



<400> 96 



Met 


He 


Tyr 


Thr 
-25 


Met 


Lys 


Lys 


Val 


Leu 


Leu 


Leu 
-10 


Asn 


Leu 


Ala 


Pro 


Ala 
-5 


Asp 


Glu 


Glu 


His 


Thr 


He 


He 


Thr 


5 










10 






Leu 


Met 


His 


Ser 


Phe 
25 


Cys 


Ala 


Phe 


Ala 


He 


Met 


Lys 
40 


Arg 


Phe 


Phe 


Phe 


Glu 


Phe 


He 
55 


Tyr 


Gly 


Gly 


Cys 


Glu 
60 


Leu 


Glu 
70 


Glu 


Cys 


Lys 


Lys 


Met 
75 


Cys 


He 


Lys 


Thr 


Thr 


Leu 


Gin 


Gin 


Glu 


85 










90 






Glu 


Asp 


Pro 


Gly 


He 
105 


Cys 


Arg 


Gly 


Asn 


Gin 


Thr 


Lys 
120 


Gin 


Cys 


Glu 


Arg 


Asn 


Gin 


Gin 


Phe 











135 



His 


Ala 


Leu 


Trp 


Ala 


Ser 


Val 


Cys 


-20 










-15 






Pro 


Leu 


Asn 


Ala 


Asp 


Ser 


Glu 


Glu 


Asp 


Thr 


Glu 


Leu 


1 

Pro 


Pro 


Leu 


Lys 






15 










20 


Lys 


Ala 


Asp 


Asp 


Gly 


Pro 


Cys 


Lys 




30 










35 




Asn 


He 


Phe 


Thr 


Arg 


Gin 


Cys 


Glu 


45 










50 






Gly 


Asn 


Gin 


Asn 


Arg 


Phe 


Glu 


Ser 










65 








Thr 


Arg 


Asp 


Asn 


Ala 


Asn 


Arg 


He 








80 










Lys 


Pro 


Asp 


Phe 


Cys 


Phe 


Leu 


Glu 






95 










100 


Tyr 


He 


Thr 


Arg 


Tyr 


Phe 


Tyr Asn 




110 










115 




Phe 


Lys 


Tyr 


Gly 


Gly 


Cys 


Leu Gly 


125 










130 
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<210> 97 

<211> 1855 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> 5'UTR 
<222> 1. .117 



<220> 
<221> CDS 
<222> 118. .504 



<220> 

<221> 3 , UTR 
<222> 505. .1855 



<220> 

<221> polyA_signal 
<222> 1819. .1824 



<220> 

<221> polyA_site 
<222> 1840. .1855 



<400> 97 

tccccggccg ccgccgttgc gctcgccgcg ctcgcactga agcccgggcc ctcgcgcgcc 60 
gcggttcgcc ccgcagcctc gccccctgcc cacccgggcg gccgtagggc ggtcacg 117 



atg 


ctg 


ccg 


ccc 


tta 


ccc 


tec 


cgc 


etc 


ggg 


ctg 


ctg 


ctg 


ctg 


ctg 


etc 


165 


Met 


Leu 


Pro 


Pro 


Leu 


Pro 


Ser 


Arg 


Leu 


Gly 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 










-20 










-15 










-10 








ctg 


tgc 


ccg 


gcg 


cac 


gtc 


ggc 


gga 


ctg 


tgg 


tgg 


get 


gtg 


ggc 


age 


ccc 


213 


Leu 


Cys 


Pro 
-5 


Ala 


His 


Val 


Gly 


Gly 
1 


Leu 


Trp 


Trp 


Ala 
5 


Val 


Gly 


Ser 


Pro 




ttg 


gtt 


atg 


gac 


cct 


acc 


age 


ate 


tgc 


agg 


aag 


gca 


egg 


egg 


ctg 


gec 


261 


Leu 


Val 


Met 


Asp 


Pro 


Thr 


Ser 


He 


Cys 


Arg 


Lys 


Ala 


Arg 


Arg 


Leu 


Ala 




10 










15 










20 










25 




ggg 


egg 


cag 


gec 


gag 


ttg 


tgc 


cag 


get 


gag 


ccg 


gaa 


gtg 


gtg 


gca 


gag 


309 


Gly 


Arg 


Gin 


Ala 


Glu 


Leu 


Cys 


Gin 


Ala 


Glu 


Pro 


Glu 


Val 


Val 


Ala 


Glu 












30 










35 










40 






ctg 


get 


egg 


ggc 


gec 


egg 


etc 


ggg 


gtg 


cga 


gag 


tgc 


cag 


ttc 


cag 


ttc 


357 


Leu 


Ala 


Arg 


Gly 


Ala 


Arg 


Leu 


Gly 


Val 


Arg 


Glu 


Cys 


Gin 


Phe 


Gin 


Phe 










45 










50 










55 








cgc 


ttc 


cgc 


cgc 


tgg 


aat 


tgc 


tec 


age 


cac 


age 


aag 


gec 


ttt 


gga 


cgc 


405 


Arg 


Phe 


Arg 


Arg 


Trp 


Asn 


Cys 


Ser 


Ser 


His 


Ser 


Lys 


Ala 


Phe 


Gly 


Arg 








60 










65 










70 










ate 


ctg 


caa 


cag 


ggt 


cag 


tgt 


ggg 


gag 


ggg 


gcg 


gaa 


gtg 


ggg 


ctg 


ctt 


453 


He 


Leu 


Gin 


Gin 


Gly 


Gin 


Cys 


Gly 


Glu 


Gly 


Ala 


Glu 


Val 


Gly 


Leu 


Leu 






75 










80 










85 












tct 


ccc 


tgc 


tgt 


ggg 


acc 


cga 


gga 


gag 


gag 


aac 


tgg 


ttc 


get 


gaa 


gtt 


501 


Ser 


Pro 


Cys 


Cys 


Gly 


Thr 


Arg 


Gly 


Glu 


Glu 


Asn 


Trp 


Phe 


Ala 


Glu 


Val 




90 










95 










100 










105 





gec tgagccccac ttccccctca catgtgtctg ggcaccctgc aaggaccctg 554 
Ala 

cctcccaggc ccctggggca gccctcccgc cgcaggtttc aggtcccagg ccccagctga 614 

ccgccccagc ccgcgctgat tgcacctgtc tgcattcaca gaeatteggg agaeggcett 674 

cgtgttcgcc ateactgegg ccggcgccag ccacgccgtc acgcaggcct gttctatggg 734 

egagctgetg cagtgegget gccaggcgcc ccgcgggcgg gcccctcccc ggccctccgg 794 

cctgcccggc acccccggac cccctggccc cgcgggctcc ceggaaggea gcgccgcctg 854 

ggagtgggga ggctgcggcg acgacgtgga etteggggae gagaagtcga ggctctttat 914 

ggacgegegg cacaageggg gaegeggaga catccgcgcg ttggtgcaac tgcacaacaa 974 

egaggeggge aggctggccg tgeggageca cacgcgcacc gagtgcaaat gccacgggct 1034 



90 
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gtcgggatca tgcgcgctgc 
cgcgcggctg ctggagcgct 
ggccctgctg cccgccgtcc 
cgccgattcg cccgacttct 
tcgcgcctgc aatagcagcg 
cgggcaccgc caggagagcg 
ctgcgtagta cagtgccacc 
cgccgcccgg ccgctagact 
gcaccggcac cgggcgcctc 
cccagggctc tggaaatggt 
cagggcgcca gacggccccg 
cctccctccc cttggcctct 
aaggcctctg gatactgggc 
tatcaataaa gatatttaaa 

<210> 98 
<211> 129 
<212> PRT 

<213> Homo sapiens 



<220> 








<221> 


SIGNAL 




<222> 


1. 


.24 




<400> 


98 




Met Leu 


Pro Pro Leu Pro 


Ser Arg Leu Gly Leu Leu Leu Leu Leu Leu 






-20 


-15 -10 


Leu Cys 


Pro Ala His Val 


Gly Gly Leu Trp Trp Ala Val Gly Ser Pro 






-5 


1 5 


Leu Val 


Met Asp Pro Thr 


Ser He Cys Arg Lys Ala Arg Arg Leu Ala 


10 




15 20 


Gly Arg 


Gin Ala Glu Leu 


Cys Gin Ala Glu Pro Glu Val Val Ala Glu 


25 




30 


35 40 


Leu Ala 


Arg Gly Ala Arg 


Leu Gly Val Arg Glu Cys Gin Phe Gin Phe 






45 


50 55 


Arg Phe 


Arg Arg Trp Asn 


Cys Ser Ser His Ser Lys Ala Phe Gly Arg 






60 


65 70 


He Leu 


Gin Gin Gly Gin 


Cys Gly Glu Gly Ala Glu Val Gly Leu Leu 






75 


80 85 


Ser Pro 


Cys Cys Gly Thr 


Arg Gly Glu Glu Asn Trp Phe Ala Glu Val 


90 




95 100 


Ala 








105 








<210> 


99 




<211> 


667 




<212> 


DNA 




<213> 


Homo sapiens 




<220> 








<221> 


5 


'UTR 




<222> 


1 


. .94 




<220> 








<221> 


CDS 




<222> 


95. .613 




<220> 








<221> 


3 


' UTR 




<222> 


614. .667 





<220> 

<221> polyA_signal 



gcacctgctg gcagaagctg cctccatttc gcgaggtggg 1094 
tccacggcgc ctcacgcgtc atgggcacca acgacggcaa 1154 
gcacgctcaa gccgccgggc cgagcggacc tcctctacgc 1214 
gcgcccccaa ccgacgcacc ggctcccccg gcacgcgcgg 1274 
ccccggacct cagcggctgc gacctgctgt gctgcggccg 1334 
tgcagctcga agagaactgc ctgtgccgct tccactggtg 1394 
gctgccgtgt gcgcaaggag ctcagcctct gcctgtgacc 1454 
gacttcgcgc agcggtggct cgcacctgtg ggacctcagg 1514 
tcgccgctcg agcccagcct ctccctgcca aagcccaact 1574 
gaggcgaggg gcttgagagg aacgcccacc cacgaaggcc 1634 
aaaaggcgct cggggagcgt ttaaaggaca ctgtacaggc 1694 
aggaggaaac agttttttag actggaaaaa agccagtcta 1754 
tccccagaac tgctggccac aggatggtgg gtgaggttag 1814 
ccaccaaaaa aaaaaaaaaa a 1855 
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<222> 636. .641 
<220> 

<221> polyA__site 
<222> 652. .667 

<400> 99 

ctctgcaaat ccaggacaca cattgtgctc cgcgctccac taaaggcttg agtgggcact 60 
gttccatctc aacagcccct gttttggaaa ggac atg att gtc aag ggg gtg gcc 115 

Met He Val Lys Gly Val Ala 



tec 


aga 


act 


gtg 


I-T+- 4- 

gtt 


tec 


aga 


ccg 


4- 4» j-t 

tec 


1 

ccc 


ggt 


aac 


tgg 


5 

ctt 


ttc tct 


163 


Ser* 




11117 

10 


Val 


vax 


oer 


Arg 


Pro 
15 


Fne 


Pro 


oiy 


Asn 


Trp 
20 


Leu 


Phe Ser 




tec 


acc 


cag 


ctg 


act 


gac 


gat 


cag 


ggc 


ccc 


gtc 


ctg 


atg 


acc 


act gta 


211 


Ser 


lie 

ZD 




Leu 


Thr 


Asp 


ASp 

3 0 


Gin 


pi,. 

Gly 


Pro 


Val 


Leu 
35 


Met 


Thr 


Tnr Val 




gcc 


acg 


cct 


gtg 


4-4-4- 

ttc 


age 


aag 


cag 


aac 


gaa 


acc 


aga 


teg 


aag 


ggc att 


259 




Mot* 

net 


Pro 


vajL 


irne 


Ser 


Lys 


Gin 


Asn 


G1U 


rpl_ 

Tnr 


Arg 


Ser 


Lys 


Gly He 




40 










45 










50 








55 




Cll 


ctg 


gga 


— 1_ 

gtg 


gec 


ggc 


aca 


gat 


gtc 


cca 


gtg 


aaa 


gaa 


ctt 


ctg aag 


307 


Leu 


T All 

Lieu 


Gly 


val 


vai 
60 


Gly 


Thr 


Asp 


Val 


Pro 
65 


Val 


Lys 


Glu 


Leu 


Leu Lys 
70 




acc 


ate 


ccc 


aaa 


tac 


aag 


tta 


ggg 


att 


cac 


ggt 


tat 


gcc 


ttt 


gca ate 


355 


Thr 


He 


Pro 


Lys 
75 


Tyr 


Lys 


Leu 


Gly 


He 
80 


His 


Gly 


Tyr 


Ala 


Phe 
85 


Ala He 




aca 


aat 


aat 


gga 


tat 


ate 


ctg 


acg 


cat 


ccg 


gaa 


etc 


agg 


ctg 


ctg tac 


403 


Thr 


Asn 


Asn 
90 


Gly 


Tyr 


He 


Leu 


Thr 
95 


His 


Pro 


Glu 


Leu 


Arg 
100 


Leu 


Leu Tyr 




gaa 


gaa 


gga 


aaa 


aag 


cga 


agg 


aaa 


cct 


aac 


tat 


agt 


age 


gtt 


gac etc 


451 


Glu 


Glu 
105 


Gly 


Lys 


Lys 


Arg 


Arg 
110 


Lys 


Pro 


Asn 


Tyr 


Ser 
115 


Ser 


Val 


Asp Leu 




tct 


gag 


gtg 


gag 


tgg 


gaa 


gac 


cga 


gat 


gac 


gtg 


ttg 


aga 


aat 


get atg 


499 


Ser 


Glu 


Val 


Glu 


Trp 


Glu 


Asp 


Arg 


Asp 


Asp 


Val 


Leu 


Arg 


Asn 


Ala Met 




120 










125 










130 








135 




gtg 


aat 


cga 


aag 


acg 


ggg 


aag 


ttt 


tec 


atg 


gag 


gtg 


aag 


aag 


aca gtg 


547 


val 


Asn 


Arg 


Lys 


Thr 
14 0 


Gly 


Lys 


Phe 


Ser 


Met 
145 


Glu 


Val 


Lys 


Lys 


Thr Val 
150 




gac 


aaa 


ggg 


gta 


cat 


ttt 


tct 


caa 


aca 


ttt 


ttg 


ctg 


ctt 


aat 


tta aaa 


595 


Asp 


Lys 


Gly 


Val 
155 


His 


Phe 


Ser 


Gin 


Thr 
160 


Phe 


Leu 


■Leu 


Leu 


Asn 
165 


Leu Lys 





caa acc act gtg aaa aat tagctttgaa agctatatct ggaataaata 643 
Gin Thr Thr Val Lys Asn 
170 

tetttegcaa aaaaaaaaaa aaaa 667 



<210> 100 
<211> 173 
<212> PRT 

<213> Homo sapiens 



<400> 100 



Met He 


Val 


Lys 


Gly 


Val 


Ala 


Ser Arg 


Thr 


Val 


Val 


Ser 


Arg 


Pro Phe 


1 






5 








10 










15 


Pro Gly 


Asn 


Trp 


Leu 


Phe 


Ser 


Ser He 


Gin 


Leu 


Thr 


Asp 


Asp 


Gin Gly 






20 








25 










30 




Pro Val 


Leu 


Met 


Thr 


Thr 


Val 


Ala Met 


Pro 


Val 


Phe 


Ser 


Lys 


Gin Asn 




35 










40 








45 






Glu Thr 


Arg 


Ser 


Lys 


Gly 


He 


Leu Leu 


Gly 


Val 


Val 


Gly 


Thr 


Asp Val 


50 










55 








60 








Pro Val 


Lys 


Glu 


Leu 


Leu 


Lys 


Thr He 


Pro 


Lys 


Tyr 


Lys 


Leu 


Gly He 


65 








70 








75 








80 


His Gly 


Tyr 


Ala 


Phe 


Ala 


He 


Thr Asn 


Asn 


Gly 


Tyr 


He 


Leu 


Thr His 
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85 90 95 

Pro Glu Leu Arg Leu Leu Tyr Glu Glu Gly Lys Lys Arg Arg Lys Pro 

100 105 110 

Asn Tyr Ser Ser Val Asp Leu Ser Glu Val Glu Trp Glu Asp Arg Asp 

115 120 125 

Asp Val Leu Arg Asn Ala Met Val Asn Arg Lys Thr Gly Lys Phe Ser 

130 135 140 

Met Glu Val Lys Lys Thr Val Asp Lys Gly Val His Phe Ser Gin Thr 
145 150 155 160 

Phe Leu Leu Leu Asn Leu Lys Gin Thr Thr Val Lys Asn 
165 170 

<210> 101 
<211> 1062 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> 5'UTR 
<222> 1. .153 

<220> 
<221> CDS 
<222> 154. .639 

<220> 

<221> 3'UTR 
<222> 640. .1062 

<220> 

<221> polyA__signal 
<222> 1023. .1028 

<220> 

<221> polyA_site 
<222> 1047. .1062 

<400> 101 

attggtgtat ggctttgcag caataactga tggctgtttc ccctcctgct ttatctttca 60 . 
gttaatgacc agccacggcg tccctgctgt gagctctggc cgctgccttc cagggctccc 120 
gagccacacg ctgggggtgc tggctgaggg aac atg get tgt tgg cct cag ctg 174 

Met Ala Cys Trp Pro Gin Leu 



agg 


ttg 


ctg 


ctg 


tgg 


aag 


aac 


etc 


act 


1 
ttc 


aga 


aga 


aga 


5 
caa 


aca 


tgt 


222 


Arg 


Leu 


Leu 


Leu 


Trp 


Lys 


Asn 


Leu 


Thr 


Phe 


Arg 


Arg 


Arg 


Gin 


Thr 


Cys 








10 










15 










20 










cag 


ctg 


ctg 


ctg 


gaa 


gtg 


gee 


tgg 


cct 


eta 


ttt 


ate 


ttc 


ctg 


ate 


ctg 


270 


Gin 


Leu 


Leu 


Leu 


Glu 


Val 


Ala 


Trp 


Pro 


Leu 


Phe 


lie 


Phe 


Leu 


He 


Leu 






25 










30 










35 












ate 


tct 


gtt 


egg 


ctg 


age 


tac 


cca 


ccc 


tat 


gaa 


caa 


cat 


gaa 


tgc 


cat 


318 


lie 


Ser 


Val 


Arg 


Leu 


Ser 


Tyr 


Pro 


Pro 


Tyr 


Glu 


Gin 


His 


Glu 


Cys 


His 




40 










45 










50 








55 




ttt 


cca 


aat 


aaa 


gee 


atg 


ccc 


tct 


gca 


gga 


aca 


ctt 


cct 


tgg 


gtt 


cag 


366 


Phe 


Pro 


Asn 


Lys 


Ala 


Met 


Pro 


Ser 


Ala 


Gly 


Thr 


Leu 


Pro 


Trp 


Val 


Gin 












60 










65 










70 






ggg 


att 


ate 


tgt 


aat 


gee 


aac 


aac 


ccc 


tgt 


ttc 


cgt 


tac 


ccg 


act 


cct 


414 


Gly 


He 


He 


Cys 


Asn 


Ala 


Asn 


Asn 


Pro 


Cys 


Phe 


Arg 


Tyr 


Pro 


Thr 


Pro 










75 










80 










85 








ggg 


gag 


get 


ccc 


gga 


gtt 


gtt 


gga 


aac 


ttt 


aac 


aaa 


tec 


att 


gtg 


get 


462 


Gly 


Glu 


Ala 


Pro 


Gly 


Val 


Val 


Gly 


Asn 


Phe 


Asn 


Lys 


Ser 


He 


val 


Ala 








90 










95 








100 










cgc 


ctg 


ttc 


tea 


gat 


get 


egg 


agg 


ctt 


ctt 


tta 


tac 


age 


cag 


aaa 


gac 


510 


Arg 


Leu 


Phe 


Ser 


Asp 


Ala 


Arg 


Arg 


Leu 


Leu 


Leu 


Tyr 


Ser 


Gin 


Lys 


Asp 
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105 110 115 

acc age atg aag gac atg cgc aaa gtt ctg aga aca tta cag cag ate 558 
Thr Ser Met Lys Asp Met Arg Lys Val Leu Arg Thr Leu Gin Gin lie 
120 125 130 135 

aag aaa tec age tea aga ggg gac aaa cgc cat ttc etc aac tgg cag 606 
Lys Lys Ser Ser Ser Arg Gly Asp Lys Arg His Phe Leu Asn Trp Gin 

140 145 150 

aag gga ctg aag cct etc cct caa gee ctt tta taggggtcct cattgtcagg 659 
Lys Gly Leu Lys Pro Leu Pro Gin Ala Leu Leu 

155 160 
cctctaagcc caagccaagc catcgcatcc cctgtgactt gcacatatac geccagatgg 719 
cctgaagtaa ctgaagaatc acaaaagaag tgaaaaggce ctgcctcgcc ttaactgatg 779 
acgttccacc attgtgattt gttcctgccc caccttaact gagtgattaa ccctgtgaat 839 
ttccttctcc tggctcagaa gctcccccac tgagcacctt gtgaccccct gcccctgccc 899 
accagagaac aacccccttt gactgtaatt ttccattacc ttcccaaatc ctataaaacg 959 
gccccacccc tatctccctt tgetgactet etttteggae tcagcccacc tgeagecagg 1019 
tgaaaaaaac agctttattg ctcacacaaa aaaaaaaaaa aaa 1062 

<210> 102 
<211> 162 
<212> PRT 

<213> Homo sapiens 
<400> 102 

Met Ala Cys Trp Pro Gin Leu Arg Leu Leu Leu Trp Lys Asn Leu Thr 

15 10 15 

Phe Arg Arg Arg Gin Thr Cys Gin Leu Leu Leu Glu Val Ala Trp Pro 

20 25 30 

Leu Phe He Phe Leu He Leu He Ser Val Arg Leu Ser Tyr Pro Pro 

35 40 45 

Tyr Glu Gin His Glu Cys His Phe Pro Asn Lys Ala Met Pro Ser Ala 

50 55 60 

Gly Thr Leu Pro Trp Val Gin Gly He He Cys Asn Ala Asn Asn Pro 
65 70 75 80 

Cys Phe Arg Tyr Pro Thr Pro Gly Glu Ala Pro Gly Val Val Gly Asn 

85 90 95 

Phe Asn Lys Ser He Val Ala Arg Leu Phe Ser Asp Ala Arg Arg Leu 

100 105 110 

Leu Leu Tyr Ser Gin Lys Asp Thr Ser Met Lys Asp Met Arg Lys Val 

115 120 125 

Leu Arg Thr Leu Gin Gin He Lys Lys Ser Ser Ser Arg Gly Asp Lys 

130 135 14 0 

Arg His Phe Leu Asn Trp Gin Lys Gly Leu Lys Pro Leu Pro Gin Ala 
145 150 155 160 

Leu Leu 



<210> 103 
<211> 933 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> 5'UTR 
<222> 1..149 



<220> 
<221> CDS 
<222> 150. .392 



<220> 

<221> 3'UTR 
<222> 393. .933 
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<220> 

<221> polyA_site 
<222> 63. .933 

<400> 103 

aaaccctcag ggacctggta tagacgcaga atctgtttca cacaacaact gctatttgaa 60 
ggaaaaaaaa aaaaagaagc aaatgatacc aagacaagct cataacagag atccaatcag 120 
cagatgtgta cggatgaaaa tacagtgag atg agt cag aaa ccg gcc aag gag 173 

Met Ser Gin Lys Pro Ala Lys Glu 

1 5 

ggt ccc aga etc tec aaa aac cag aag tac tec gaa cac ttc age ata 221 
Gly Pro Arg Leu Ser Lys Asn Gin Lys Tyr Ser Glu His Phe Ser He 

10 15 20 

cac tgc tgc ccg ccg ttc acc ttc etc aat tec aag aag gag ata gtg 269 
His Cys Cys Pro Pro Phe Thr Phe Leu Asn Ser Lys Lys Glu He Val 
25 30 35 40 

gat egg aaa tac age ate tgt aag age ggc tgc ttc tac cag aag aaa 317 
Asp Arg Lys Tyr Ser He Cys Lys Ser Gly Cys Phe* Tyr Gin Lys Lys 

45 50 55 

gag gag gac tgg ate tgc tgc gcc tgc cag aag acc aga ttg aaa agg 365 
Glu Glu Asp Trp He Cys Cys Ala Cys Gin Lys Thr Arg Leu Lys Arg 

60 65 70 

aag ate agg cca acc cca aag aag aag tgaccaagga ggagtttaaa 412 
Lys He Arg Pro Thr Pro Lys Lys Lys 

75 80 
ytgaatgaac aacctcggct cctggactca ttgettcaca acccatctac ccctggatga 472 
agttatctgg cttcaaatat tatgeagggg caaacacctg ctgatgtggc aactgetgat 532 
gctcatggtc cccatggcat gggggectea gggcagcctg cctggagtac tttgaagatg 592 
tcatcccatt gtcttctgac ctctataatt gecactgaga gatctgetgt cagtctgett 652 
atccttccac ggactcaagt ttcttcaatc tgaagataca tgtctttctc caaggacatg 712 
tggaaaaaaa aaagatgtta tacaaccatc aaagtggcaa aaataaaaaa aattggctgg 772 
gcgtggtggc gggcgcctgt ggtcccagct actegggagg ctgaggcagg agaatggcgt 832 
gaacctggga ggeggagett geagtgagee gagategcac cactgcactc cagcctgggc 892 
gaeagagega gactctgtct caaacaaaaa aaaaaaaaaa a 933 

<210> 104 
<211> 81 
<212> PRT 

<213> Homo sapiens 
<400> 104 

Met Ser Gin Lys Pro Ala Lys Glu Gly Pro Arg Leu Ser Lys Asn Gin 

1 5 10 15 

Lys Tyr Ser Glu His Phe Ser He His Cys Cys Pro Pro Phe Thr Phe 

20 25 30 

Leu Asn Ser Lys Lys Glu He Val Asp Arg Lys Tyr Ser He Cys Lys 

35 40 45 

Ser Gly Cys Phe Tyr Gin Lys Lys Glu Glu Asp Trp He Cys Cys Ala 

50 55 60 

Cys Gin Lys Thr Arg Leu Lys Arg Lys He Arg Pro Thr Pro Lys Lys 
65 70 75 80 

Lys 

<210> 105 
<211> 1187 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> 5'UTR 
<222> 1. .34 

<220> 
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<221> CDS 
<222> 35. .1069 

<220> 

<221> 3'UTR 
<222> 1070. .1187 

<220> 

<221> polyA_signal 
<222> 1146. .1151 

<220> 

<221> polyA_site 
<222> 1172. .1187 

<400> 105 

accactttgg tagtgccagt gtgactcatc caca atg att tct cca gtg etc ate 55 

Met He Ser Pro Val Leu He 
-15 



ttg ttc 


teg 


agt 


ttt 


etc 


tgc 


cat 


gtt 


get 


att 


gca 


gga 


egg 


acc 


tgt 


103 


Leu 


Phe 
-10 


Ser 


Ser 


Phe 


Leu 


Cys 
-5 


His 


Val 


Ala 


He 


Ala 
1 


Gly 


Arg 


Thr 


Cys 
5 




ccc 


aag 


cca 


gat 


gat 


tta 


cca 


ttt 


tec 


aca 


gtg 


gtc 


ccg 


tta 


aaa 


aca 


151 


Pro Lys 


Pro 


Asp 


Asp 


Leu 


Pro 


Phe 


Ser 


Thr 


Val 


Val 


Pro 


Leu 


Lys 


Thr 












10 










15 










20 






ttc 


tat 


gag 


cca 


gga 


gaa 


gag 


att 


acg 


tat 


tec 


tgc 


aag 


ccg 


ggc 


tat 


199 


Phe Tyr 


Glu 


Pro 


Gly 


Glu 


Glu 


He 


Thr 


Tyr 


Ser 


Cys 


Lys 


Pro 


Gly 


Tyr 










25 










30 










35 








gtg tec 


cga 




ggg 


atg 


aga 


aag 


ttt 


ate 


tgc 


cct 


etc 


aca 


gga 


ctg 


247 


Val 


Ser 


Arg 

A f\ 

40 


Gly 


Gly 


Met 


Arg 


Lys 
45 


Phe 


He 


Cys 


Pro 


Leu 
50 


Thr 


Gly 


Leu 




tgg 


etc 


ate 


aac 


act 


ctg 


aaa 


tgt 


aca 


ccc 


aga 


gta 


tgt 


cct 


ttt 


get 


2 95 


Trp Leu 


± j.e 


Asn 


rn"l_ „ 

inr 


T All 

Leu 


T t rn 

Lys 


cys 


Thr 


Pro 


Arg 


Val 


Cys 


Pro 


Phe 


Ala 






55 










60 










65 












gga 


ate 


tta 


gaa 


aat 


gga 


gec 


gta 


cgc 


tat 


acg 


act 


ttt 


gaa 


tat 


ccc 


343 


Gly He 


Leu 


LjJ.U 


Asn 


<jj.y 


Ala 


vai 


Arg 


Tyr 


Thr 


Thr 


Phe 


Glu 


Tyr 


Pro 




70 










/ D 










o U 








B5 




aac 


acg 


ate 


agt 


ttt 


tct 


tgt 


aac 


act 


ggg 


ttt 


tat 


ctg 


aat 


ggc 


get 


391 


Asn 


Thr 


He 


Ser 


Phe 
90 


Ser 


Cys 


Asn 


Thr 


Gly 
95 


Phe 


Tyr 


Leu 


Asn 


Gly 
100 


Ala 




gat 


tct 


gec 


aag 


tgc 


act 


gag 


gaa 


gga 


aaa 


tgg 


age 


ccg 


gag 


ctt 


cct 


439 


Asp 


Ser 


Ala 


Lys 
105 


Cys 


Thr 


Glu 


Glu 


Gly 
110 


Lys 


Trp 


Ser 


Pro 


Glu 
115 


Leu 


Pro 




gtc tgt 


get 


ccc 


ate 


ate 


tgc 


cct 


cca 


cca 


tec 


ata 


cct 


acg 


ttt 


gca 


487 


Val 


Cys 


Ala 
120 


Pro 


He 


He 


Cys 


Pro 
125 


Pro 


Pro 


Ser 


He 


Pro 
130 


Thr 


Phe 


Ala 




aca 


ctt 


cgt 


gtt 


tat 


aag 


cca 


tea 


get 


gga 


aac 


aat 


tec 


etc 


tat 


egg 


535 


Thr 


Leu 


Arg 


Val 


Tyr 


Lys 


Pro 


Ser 


Ala 


Gly 


Asn 


Asn 


Ser 


Leu 


Tyr 


Arg • 






135 










140 










145 








gac" 


aca 


gca 


gtt 


ttt 


gaa 


tgt 


ttg 


cca 


caa 


cat 


gcg 


atg 


ttt 


gga 


aat 


583 


Asp 


Thr 


Ala 


Val 


Phe 


Glu 


Cys 


Leu 


Pro 


Gin 


His 


Ala 


Met 


Phe 


Gly 


Asn 




150 










155 










160 








165 




gat 


aca 


att 


acc 


tgc 


acg 


aca 


cat 


gga 


aat 


tgg 


act 


aaa 


tta 


cca 


gaa 


631 


Asp 


Thr 


He 


Thr 


Cys 
170 


Thr 


Thr 


His 


Gly 


Asn 
175 


Trp 


Thr 


Lys 


Leu 


Pro 
180 


Glu 




tgc 


agg 


gaa 


gta 


aaa 


tgc 


cca 


ttc 


cca 


tea 


aga 


cca 


gac 


aat 


gga 


ttt 


679 


Cys 


Arg 


Glu 


Val 


Lys 


Cys 


Pro 


Phe 


Pro 


Ser 


Arg 


Pro 


Asp 


Asn 


Gly 


Phe 










185 










190 








195 








gtg 


aac 


tat 


cct 


gca 


aaa 


cca 


aca 


ctt 


tat 


tac 


aag 


gat 


aaa 


gec 


aca 


727 


Val 


Asn 


Tyr 
200 


Pro 


Ala 


Lys 


Pro 


Thr 
205 


Leu 


Tyr 


Tyr 


Lys 


Asp 
210 


Lys 


Ala 


Thr 




ttt 


ggc 


tgc 


cat 


gat 


gga 


tat 


tct 


ctg 


gat 


ggc 


ccg 


gaa 


gaa 


ata 


gaa 


775 


Phe 


Gly 


Cys 


His 


Asp 


Gly 


Tyr 


Ser 


Leu 


Asp 


Gly 


Pro 


Glu 


Glu 


He 


Glu 
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215 220 225 



tgt 


acc 


aaa 


ctg 


gga 


aac 


tgg 


tct 


gec 


atg 


cca 


agt 


tgt 


aaa 


gca 


tct 


823 


Cys 


Thr 


Lys 


Leu 


Gly 


Asn 


Trp 


Ser 


Ala 


Met 


Pro 


Ser 


Cys 


Lys 


Ala 


Ser 




o o n 










235 










240 










245 




tgt 


aaa 


gta 


cct 


gtg 


aaa 


aaa 


gec 


act 


gtg 


gtg 


tac 


caa 


gga 


gag 


aga 


871 


Cys 


Lys 


vai 


Pro 


vai 


Lys 


Lys 


Ala 


Thr 


vai 


TT- "I 

vai 


Tyr 




Gly 


Glu 


Arg 












o c t\ 
2bU 










255 










O C f\ 

260 






gta 


aag 


att 


cag 


gaa 


aaa 


4- 4- 4- 
ttt 


aag 


aat 


gga 


atg 


eta 


— n 4- 

cat 


ggt 


gat 


aaa 


919 


vai 


Lys 


Tie* 

lie 


Gin 


G1U 


Lys 


pne 


Lys 


Asn 


Gly 


Met 


T A** 

Leu 


TJ-J r-l 
HIS 


Gly 


Asp 


Lys 










265 










270 










275 








gtt 


tct 


ttc 


ttc 


tgc 


aaa 


aat 


aag 


gaa 


aag 


aag 


tgt 


age 


tat 


aca 


gag 


967 


Val 


Ser 


Phe 


Phe 


Cys 


Lys 


Asn 


Lys 


Glu 


Lys 


Lys 


Cys 


Ser 


Tyr 


Thr 


Glu 








280 










285 










290 










gat 


get 


cag 


tgt 


ata 


gat 


ggc 


act 


ate 


gaa 


gtc 


ccc 


aaa 


tgc 


ttc 


aag 


1015 


Asp 


Ala 


Gin 


Cys 


lie 


Asp 


Gly 


Thr 


lie 


Glu 


Val 


Pro 


Lys 


Cys 


Phe 


Lys 






295 










300 










305 












gaa 


cac 


agt 


tct 


ctg 


get 


ttt 


tgg 


aaa 


act 


gat 


gca 


tec 


gat 


gta 


aag 


1063 


Glu 


His 


Ser 


Ser 


Leu 


Ala 


Phe 


Trp 


Lys 


Thr 


Asp 


Ala 


Ser 


Asp 


Val 


Lys 




310 










315 










320 










325 





cca tgc taaggtggtt ttcagattcc acataaaatg tcacacttgt ttcttgttca 1119 
Pro Cys 

tccaaggaac ctaattgaaa tttaaaaata aagctactga atttattgee gcaaaaaaaa 1179 
aaaaaaaa 1187 

<210> 106 
<211> 345 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> 1. .19 



<400> 106 



Met 


lie 


Ser 


Pro 


Val 


Leu 


lie 


Leu 


Phe 


Ser 


Ser 


Phe 


Leu 


Cys 


His 


Val 










-15 










-10 










-5 




Ala 


lie 


Ala 


Gly 


Arg 


Thr 


Cys 


Pro 


Lys 


Pro 


Asp 


Asp 


Leu 


Pro 


Phe 


Ser 








1 








5 










10 








Thr 


Val 


Val 


Pro 


Leu 


Lys 


Thr 


Phe 


Tyr 


Glu 


Pro 


Gly 


Glu 


Glu 


He 


Thr 




15 










20 










25 










Tyr 


Ser 


Cys 


Lys 


Pro 


Gly 


Tyr 


Val 


Ser 


Arg 


Gly 


Gly 


Met 


Arg 


Lys 


Phe 


30 










35 










40 










45 


lie 


Cys 


Pro 


Leu 


Thr 


Gly 


Leu 


Trp 


Leu 


He 


Asn 


Thr 


Leu 


Lys 


Cys 


Thr 










50 










55 










60 




Pro 


Arg 


Val 


Cys 


Pro 


Phe 


Ala 


Gly 


He 


Leu 


Glu 


Asn 


Gly 


Ala 


Val 


Arg 








65 










70 










75 






Tyr 


Thr 


Thr 


Phe 


Glu 


Tyr 


Pro 


Asn 


Thr 


He 


Ser 


Phe 


Ser 


Cys 


Asn 


Thr 






80 










85 










90 








Gly 


Phe 


Tyr 


Leu 


Asn 


Gly 


Ala 


Asp 


Ser 


Ala 


Lys 


Cys 


Thr 


Glu 


Glu 


Gly 




95 










100 










105 










Lys 


Trp 


Ser 


Pro 


Glu. 


Leu 


Pro 


Val 


Cys 


Ala 


Pro 


He 


He 


Cys 


Pro 


Pro 


110 










115 










120 










125 


Pro 


Ser 


lie 


Pro 


Thr 


Phe 


Ala 


Thr 


Leu 


Arg 


Val 


Tyr 


Lys 


Pro 


Ser 


Ala 










130 










135 










140 




Gly 


Asn 


Asn 


Ser 


Leu 


Tyr 


Arg 


Asp 


Thr 


Ala 


Val 


Phe 


Glu 


Cys 


Leu 


Pro 








145 










150 










155 






Gin 


His 


Ala 


Met 


Phe 


Gly 


Asn 


Asp 


Thr 


He 


Thr 


Cys 


Thr 


Thr 


His 


Gly 






160 










165 










170 








Asn 


Trp 


Thr 


Lys 


Leu 


Pro 


Glu 


Cys 


Arg 


Glu 


Val 


Lys 


Cys 


Pro 


Phe 


Pro 




175 










180 










185 










Ser 


Arg 


Pro 


Asp 


Asn 


Gly 


Phe 


Val 


Asn 


Tyr 


Pro 


Ala 


Lys 


Pro 


Thr 


Leu 


190 










195 










200 










205 


Tyr 


Tyr 


Lys 


Asp 


Lys 


Ala 


Thr 


Phe 


Gly 


Cys 


His 


Asp 


Gly 


Tyr 


Ser 


Leu 
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210 


21 5 


22 0 




Asp 




Pro 


pi ii 


ulU XJLC ulU D 


xiix xjy o xjcu oiy nsii ±JL r 


ocx 


Al a 
nl A 








225 




230 235 






Mot- 
lie L. 


Jr X (J 


Ser 


Cys 


ijy s rixa • ocJL Lys 


t,vb Val Pro Val Ta/s TiVcj 
xty o vax riu vax uy d j 


Ala 

nX A 


X I1X 






240 




245 


250 






Val 


Val 

V AX 


±yr 


VJ JLJLi. 


Rlv Rlu Atcj Val 
uiy ujlu niy v ai 


T»vq Tip Gin fllu TjVS Phf* 

XJy o lie vj J- ix uy o rue 


TjVS 


As ii 




255 






260 


265 










Leu 


nib 


rtl v A cjt> Ta/o \Ts»1 
oiy .nok/ xi y d vdi 


ocx Jl ii" riic \— _y e> J-iy o nou 


xjy fc> 


Clin 

OX VI 


270 








275 


280 




285 


Lys 


Lys 


Cys 


Ser 


Tyr Thr Glu Asp 


Ala Gin Cys lie Asp Gly 


Thr 


lie 










290 


295 


300 




Glu 


Val 


Pro 


Lys 


Cys Phe Lys Glu 


His Ser Ser Leu Ala Phe 


Trp 


Lys 








305 




310 315 






Thr 


Asp 


Ala 


Ser 


Asp Val Lys Pro 


Cys 










320 




325 









<210> 107 

<211> 1520 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5'UTR 
<222> 1..15 

<220> 
<221> CDS 
<222> 16. .1449 

<220> 

<221> 3'UTR 
<222> 1450. .1520 

<220> 

<221> polyA — signal 
<222> 1483. .1488 

<220> 

<221> polyA_site 
<222> 1505. .1520 

<400> 107 

cttttttttg acaag atg gcg gca gga ggc agt ggc gtt ggt ggg aag cgc 51 
Met Ala Ala Gly Gly Ser Gly Val Gly Gly Lys Arg 
1 5 10 



age 


teg 


aaa 


age 


gat 


gee 


gat 


tct 


ggt 


ttc 


ctg 


ggg 


ctg 


egg 


ccc 


act 


99 


Ser 


Ser 


Lys 


Ser 


Asp 


Ala 


Asp 


Ser 


Gly 


Phe 


Leu 


Gly 


Leu 


Arg 


Pro 


Thr 








15 










20 










25 










teg 


gtg 


gac 


cca 


gcg 


ctg 


agg 


egg 


egg 


egg 


cga 


ggc 


cca 


aga 


aat 


aag 


147 


Ser 


val 


Asp 


Pro 


Ala 


Leu 


Arg 


Arg 


Arg 


Arg 


Arg 


Gly 


Pro 


Arg 


Asn 


Lys 






30 










35 










40 












aag 


egg 


ggc 


tgg 


egg 


egg 


ctt 


get 


cag 


gag 


ccg 


ctg 


ggg 


ctg 


gag 


gtt 


195 


Lys 


Arg 


Gly 


Trp 


Arg 


Arg 


Leu 


Ala 


Gin 


Glu 


Pro 


Leu 


Gly 


Leu 


Glu 


Val 




45 










50 










55 










60 




gac 


cag 


ttc 


ctg 


gaa 


gac 


gtg 


egg 


eta 


cag 


gag 


cgc 


acg 


age 


ggt 


ggc 


243 


Asp 


Gin 


Phe 


Leu 


Glu 


Asp 


Val 


Arg 


Leu 


Gin 


Glu 


Arg 


Thr 


Ser 


Gly 


Gly 












65 










70 










75 






ttg 


ttg 


tea 


gag 


gee 


cca 


aat 


gaa 


aaa 


etc 


ttc 


ttc 


gtg 


gac 


act 


ggc 


291 


Leu 


Leu 


Ser 


Glu 


Ala 


Pro 


Asn 


Glu 


Lys 


Leu 


Phe 


Phe 


Val 


Asp 


Thr 


Gly 










80 










85 










90 








tec 


aag 


gaa 


aaa 


ggg 


ctg 


aca 


aag 


aag 


aga 


ace 


aaa 


gtc 


cag 


aag 


aag 


339 


Ser 


Lys 


Glu 


Lys 


Gly 


Leu 


Thr 


Lys 


Lys 


Arg 


Thr 


Lys 


Val 


Gin 


Lys 


Lys 
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tea ctg 


ctt 


etc 


aag 


aaa 


ccc 


ctt 


egg 


gtt 


gac 


etc 


ate etc 


aaq 

3 "3 


aac 


387 


Ser Leu 


Leu 


Leu 


Lys 


Lys 


Pro 


Leu 


Arg 


val 


Asp 

Xr 


Leu 


He Leu 


Glu 


Asn 




110 










115 










120 










aca tec 


aaa 


gtc 


cct 


gee 


ccc 


aaa 


gac 


gtc 


etc 


gee 


cac cag 


gtc 


ccc 


435 


Thr Ser 


Lys 


Val 


Pro 


Ala 


Pro 


Lys 


Asp 


Val 


Leu 


Ala 


His Gin 


Val 


Pro 




125 








130 










135 








140 




aac gec 


aag 


aag 


etc 


agg 


egg 


aag 


gag 


cag 


eta 


tgg 


gag aag 


ctq 

w 3 


gec 


483 


Asn Ala 


Lys 


Lys 


Leu 


Arg 


Arg 


Lys 


Glu 


Gin 


Leu 


Trp 


Glu Lys 


Leu 


Ala 










145 










150 








155 






aag cag 


ggc 


gag 


ctg 


ccc 


egg 


gag 


gtg 


cgc 


agg 


gee 


cag gee 


egg 


etc 


531 


Lys Gin 


Gly 


Glu 


Leu 


Pro 


Arg 


Glu 


Val 


Arq 

* ZJ 


Arq 


Ala 


Gin Ala 


Arq 


Leu 








160 










165 








170 








etc aac 


cct 


tct 


gca 


aca 


agg 


gee 


aaq 

*~3 


ccc 


qqq 

333 


ccc 


cag gac 


acc 


qta 


579 


Leu Asn 


Pro 


Ser 


Ala 


Thr 


Arq 

ZJ 


Ala 


Lvs 


Pro 


Glv 


Pro 


Gin Asp 


Thr 


Val 






175 










180 










185 








gag egg 


ccc 


ttc 


tac 


qac 

3^ 


etc 


tqq 


qcc 


tea 


qac 


aac 


ccc ctg 


gac 


aaa 

a 3 3 


627 


Glu Arg 


Pro 


Phe 


Tyr 


Asp 


Leu 


Trp 

xr 


Ala 


Ser 


Asp 


Asn 


Pro Leu 


Asp 


Ara 




190 










195 










200 










CCq ttq 


qtt 


qqc 


caq 


qat 
3 ^ 


qaq 


ttt 


ttc 


ctq 


qaq 

3 a 3 


cag 


acc aaa 


aag 


aaa 


675 


Pro Leu 


Val 


Gly 


Gin 


Asp 


Glu 


Phe 


Phe 


Leu 


Glu 


Gin 


Thr Lys 


Lys 


Lys 




205 








210 










215 








220 




qqa qtq 


aaa 


cqq 


cca 


qca 


cac 


ctg 


cac 


acc 


aag 


ccg 


tec caa 


gca 


ccc 


/ £,Zi 


Gly Val 


Lvs 


Arg 


Pro 


Ala 


Arq 


Leu 


His 


Thr 


Lvs 


Pro 


Ser Gin 


Ala 


Pro 










225 










230 








235 






qcc qtq 

3 3 *"3 


qaq 


qtq 

3 3 


qcq 

3^3 


cct 


qcc 

~j w w 


qqa 

33° 


get 


tec 


tac 


aat 


cca tec 


ttt 


gaa 


771 


Ala Val 


Glu 


Val 


Ala 


Pro 


Ala 


Glv 


Ala 


Ser 


Tvr 


Asn 


Pro Ser 


Phe 


Glu 








240 










245 








250 








gac cac 


caq 


acc 


ctq 


etc 


tea 


qcq 

3 w 3 


qcc 


cac 


aaa 

3 a 3 


ata 

3 *-3 


gag ttg 


cag 


caa 


819 


Asp His 


Gin 


Thr 


Leu 


Leu 


Ser 


Ala 


Ala 


His 


Glu 


Val 


Glu Leu 


Gin 


Ara 






255 










260 










265 








cag aag 


gag 


gcg 


gag 


aag 


ctg 


gag 


cqq 

33 


caq 


Ctq 


qcc 

3 


ctg ccc 


acc 


aca 


867 


Gin Lys 


Glu 


Ala 


Glu 


Lvs 


Leu 


Glu 


Arq 


Gin 


Leu 


Ala 


Leu Pro 


Ala 


Thr 




270 










275 










280 










gag cag 


qcc 

ZJ 


gee 


acc 


caq 

-3 


qaq 


tec 


aca 


ttc 


caq 

vcl 3) 


qaq 


ctg tgc 


aaa 


aaa 


915 


Glu Gin 


Ala 


Ala 


Thr 


Gin 


Glu 


Ser 


Thr 


Phe 


Gin 


Glu 


Leu Cys 


Glu 


Glv 




285 








290 










295 






300 




ctg ctg 


gag 


gag 


teg 


gat 


ggt 


qaq 

3 -3 


qqq 

333 


qaq 


cca 


qqc 

33 v * 


caa qqc 

v -* c *3 33 s * 


aaa 


aaa 

333 


963 


Leu Leu 


Glu 


Glu 


Ser 


Asp 

r 


Glv 


Glu 


Glv 


Glu 


Pro 


Glv 


Gin Gly 


Glu 


Glv 










305 










310 








315 






ccq qaq 

"3 ZJ ZJ 


qct 


qqq 


qat 

13 


qcc 

-3 


qaq 

-3 -3 


qtc 

-3 ^ 


tqt 

v*3 »- 


ccc 


acq 


CCC 


gee cgc 


ctg 


gee 


1011 


Pro Glu 


Ala 


Gly 


Asp 


Ala 


Glu 


Val 


Cvs 


Pro 


Thr 


Pro 


Ala Arg 


Leu 


Ala 








320 










325 








330 








acc aca 


gag 


aaq 


aag 


acg 


qaq 

-3 —3 


cag 


caq 

-3 


cqq 

s *33 


cqq 

'-33 


cqq 

^33 


aaa aaa 
3 a y oB 3 


get 


ata 

3 *-3 


1059 


Thr Thr 


Glu 


Lys 


Lys 


Thr 


Glu 


Gin 


Gin 


Arq 


Arq 


Arq 


Glu Lys 


Ala 


Val 






335 










340 










345 








cac agg 


ctg 


egg 


gta 


cag 


cag 


gee 


gcg 


ttg 


egg 


gee 


qcc cqq 

3 WW v 33 


etc 


cqq 

v 33 


1107 


His Arg 


Leu 


Arg 


Val 


Gin 


Gin 


Ala 


Ala 


Leu 


Arg 


Ala 


Ala Arg 


Leu 


Arq 




350 










355 










360 








cac cag 


9ag 


ctg 


ttc 


egg 


ctg 


cgc 


ggg 


ate 


aag 


gee 


cag gtg 


gee 


ctg 


1155 


His Gin 


Glu 


Leu 


Phe 


Arg 


Leu 


Arg 


Gly 


He 


Lys 


Ala 


Gin Val 


Ala 


Leu 




365 








370 










375 








380 




agg ctg 


gcg 


gag 


ctg 


gcg 


egg 


egg 


cag 


agg 


egg 


cqq 


caq qcq 

3 3 W 3 


Cqq 

3 3 


egg 


1203 


Arg Leu 


Ala 


Glu 


Leu 


Ala 


Arg 


Arg 


Gin 


Arg 


Arg 


Arg 


Gin Ala 


Arg 


Arg 










385 










390 








395 






gag get 


gag 


get 


gac 


aag 


ccc 


ega 


agg 


Ctq 


qqq 

333 


cqq 

33 


etc aag 


tac 


caq 


1251 


Glu Ala 


Glu 


Ala 


Asp 


Lvs 


Pro 


Ara 


Arq 


Leu 


Glv 


Arg 


lieu uy o 


Tvr 


Gin 








400 










405 








410 








gca cct 


gac 


ate 


gac 


gtg 


cag 


ctg 


age 


teg 


gag 


ctg 


aca gac 


tcg 


etc 


1299 


Ala Pro 


Asp 


lie 


Asp 


Val 


Gin 


Leu 


Ser 


Ser 


Glu 


Leu 


Thr Asp 


Ser 


Leu 






415 










420 










425 








agg acc 


ctg 


aag 


ccc 


gag 


ggc 


aac 


ate 


ctt 


cga 


gac 


egg ttc 


aag 


age 


1347 


Arg Thr 


Leu 


Lys 


Pro 


Glu 


Gly 


Asn 


He 


Leu 


Arg 


Asp 


Arg Phe 


Lys 


Ser 




430 










435 










440 
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ttc cag agg agg aat atg ate gag cct cga gag aga gec aag ttc aaa 1395 

Phe Gin Arg Arg Asn Met lie Glu Pro Arg Glu Arg Ala Lys Phe Lys 

445 450 455 460 

cgc aag tac aag gtg aag ctg gtg gag aag egg gcg ttc cgt gag ate 1443 

Arg Lys Tyr Lys Val Lys Leu Val Glu Lys Arg Ala Phe Arg Glu lie 

465 470 475 

cag ttg tagctgecat cagatgeegg agactcgccc ttcaataaaa aatctcttct 1499 
Gin Leu 

agctcaaaaa aaaaaaaaaa a 1520 

<210> 108 
<211> 478 
<212> PRT 
<213> Homo sapiens 

<400> 108 

Met Ala Ala Gly Gly Ser Gly Val Gly Gly Lys Arg Ser Ser Lys Ser 

15 10 15 

Asp Ala Asp Ser Gly Phe Leu Gly Leu Arg Pro Thr Ser Val Asp Pro 

20 25 30 

Ala Leu Arg Arg Arg Arg Arg Gly Pro Arg Asn Lys Lys Arg Gly Trp 

35 40 45 

Arg Arg Leu Ala Gin Glu Pro Leu Gly Leu Glu Val Asp Gin Phe Leu 

50 55 60 

Glu Asp Val Arg Leu Gin Glu Arg Thr Ser Gly Gly Leu Leu Ser Glu 
65 70 75 80 

Ala Pro Asn Glu Lys Leu Phe Phe Val Asp Thr Gly Ser Lys Glu Lys 

85 90 95 

Gly Leu Thr Lys Lys Arg Thr Lys Val Gin Lys Lys Ser Leu Leu Leu 

100 105 110 

Lys Lys Pro Leu Arg Val Asp Leu lie Leu Glu Asn Thr Ser Iiys Val 

115 120 125 

Pro Ala Pro Lys Asp Val Leu Ala His Gin Val Pro Asn Ala Lys Lys 

130 135 140 

Leu Arg Arg Lys Glu Gin Leu Trp Glu Lys Leu Ala Lys Gin Gly Glu 
145 150 155 160 

Leu Pro Arg Glu Val Arg Arg Ala Gin Ala Arg Leu Leu Asn Pro Ser 

165 170 175 

Ala Thr Arg Ala Lys Pro Gly Pro Gin Asp Thr Val Glu Arg Pro Phe 

180 185 190 

Tyr Asp Leu Trp Ala Ser Asp Asn Pro Leu Asp Arg Pro Leu Val Gly 

195 200 205 

Gin Asp Glu Phe Phe Leu Glu Gin Thr Lys Lys Lys Gly Val Lys Arg 

210 215 220 

Pro Ala Arg Leu His Thr Lys Pro Ser Gin Ala Pro Ala Val Glu Val 
225 230 235 240 

Ala Pro Ala Gly Ala Ser Tyr Asn Pro Ser Phe Glu Asp His Gin Thr 

245 250 255 

Leu Leu Ser Ala Ala His Glu Val Glu Leu Gin Arg Gin Lys Glu Ala 

260 265 270 

Glu Lys Leu Glu Arg Gin Leu Ala Leu Pro Ala Thr Glu Gin Ala Ala 

275 280 285 

Thr Gin Glu Ser Thr Phe Gin Glu Leu Cys Glu Gly Leu Leu Glu Glu 

290 295 300 

Ser Asp Gly Glu Gly Glu Pro Gly Gin Gly Glu Gly Pro Glu Ala Gly 
305 310 315 320 

Asp Ala Glu Val Cys Pro Thr Pro Ala Arg Leu Ala Thr Thr Glu Lys 

325 330 335 

Lys Thr Glu Gin Gin Arg Arg Arg Glu Lys Ala Val His Arg Leu Arg 

340 345 350 

Val Gin Gin Ala Ala Leu Arg Ala Ala Arg Leu Arg His Gin Glu Leu 

355 360 365 

Phe Arg Leu Arg Gly lie Lys Ala Gin Val Ala Leu Arg Leu Ala Glu 

100 
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370 




375 








380 






Leu 


Ala 


Arg Arg Gin Arg 


Arg 


Arg 


Gin 


Ala 


Arg Arg 


Glu Ala 


Glu Ala 


385 




390 










395 




400 


Asp 


Lys 


Pro Arg Arg Leu 


Gly 


Arg 


Leu 


Lys 


Tyr Gin 


Ala Pro 


Asp He 






405 








410 






415 


Asp 


Val 


Gin Leu Ser Ser 


Glu 


Leu 


Thr 


Asp 


Ser Leu 


Arg Thr 


Leu Lys 






420 






425 






430 




Pro 


Glu 


Gly Asn He Leu 


Arg 


Asp 


Arg 


Phe 


Lys Ser 


Phe Gin 


Arg Arg 






435 




440 








445 




Asn 


Met 


He Glu Pro Arg 


Glu 


Arg 


Ala 


Lys 


Phe Lys 


Arg Lys 


Tyr Lys 




450 




455 








460 






Val 


Lys 


Leu Val Glu Lys 


Arg 


Ala 


Phe 


Arg 


Glu He 


Gin Leu 




465 




470 










475 







<210> 109 

<211> 1789 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> 5'UTR 
<222> 1. .94 

<220> 
<221> CDS 
<222> 95.. 1252 



<220> 

<221> 3»UTR 
<222> 1253. .1789 



<220> 

<221> polyA_signal 
<222> 1751. .1756 



<220> 

<221> polyA_site 
<222> 1774. .1789 



<400> 109 

ggtcttgcaa tatttattct gctttcgggt agatgggagg cccggggacc tggctgggtt 60 
tctgccaagc ttctccgata cccaggtttc ataa atg tgt ttg ttg ctt tec tgc 115 

Met Cys Leu Leu Leu Ser Cys 
-10 

cct tgc cac ccc tct gec cac gga cag tec atg tgg att gag aga acc 163 
Pro Cys His Pro Ser Ala His Gly Gin Ser Met Trp He Glu Arg Thr 

-5 15 
tec ttc gtg act gca tac aag ctg ccg ggg ate ctg cgc tgg ttt gag 211 
Ser Phe Val Thr Ala Tyr Lys Leu Pro Gly He Leu Arg Trp Phe Glu 
10 15 20 25 

gtg gtg cac atg teg cag acc aca att agt cct ctg gag aat gec ata 259 
Val Val His Met Ser Gin Thr Thr He Ser Pro Leu Glu Asn Ala He 

30 35 40 

gaa acc atg tec acg gec aat gag aag ate ctg atg atg ata aac cag 307 
Glu Thr Met Ser Thr Ala Asn Glu Lys He Leu Met Met He Asn Gin 

45 50 55 

tac cag agt gat gag acc etc ccc ate aac cca etc tec atg etc ctg 355 
Tyr Gin Ser Asp Glu Thr Leu Pro He Asn Pro Leu Ser Met Leu Leu 

60 65 70 

aac ggg att gtg gac cct get gtc atg gga ggc ttc gee aag tat gag 403 
Asn Gly He Val Asp Pro Ala Val Met Gly Gly Phe Ala Lys Tyr Glu 

75 80 85 

aag gee ttc ttc act gaa gag tat gtc agg gac cac cct gag gac cag 451 
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Lys Ala Phe 


Phe 


Thr 


Glu 


Glu Tyr 


Val Arg 


Asp His Pro 


Glu Asp 


Gin 




90 








95 






100 






105 




gac aag 


ctg 


ace 


cac 


etc 


aag gac 


ctg att 


gca tgg cag 


ate 


ccc 


ttc 


499 


Asp Lys Leu 


Thr 


His 


Leu 


Lys Asp 


Leu He 


Ala Trp Gin 


He 


Pro 


Til* _ 

Phe 










110 






115 






120 






ttg gga get 


ggg 


att 


aag 


ate cat 


gag aaa 


agg gtg tea 


gat 


aac 


ttg 


547 


Leu Gly Ala 


Gly 


lie 


Lys 


He His 


Glu Lys 


Arg Val Ser 


Asp Asn 


Leu 








125 








X3Q 




135 








cga ccc 


ttc 


cat 


gac 


egg 


atg gag 


gaa tgt 


ttc aag aac 


ctg 


aaa 


atg 


c o c 


Arg Pro Phe 


His 


Asp 


Arg 


Met Glu 


Glu Cys 


Phe Lys Asn 


Leu 


Lys 


Met 






14 0 








145 




150 










a ag gtg gag 


aag 


gag 


tac 


ggt gtc 


cga gag 


atg cct gac 


ttt 


gac 


gac 


643 


Lys Val Glu 


Lys 


Glu 


Tyr 


Gly Val 


Arg Glu 


Met Pro Asp 


Phe Asp 


Asp 




155 










160 




165 










agg aga 


gtg 


ggc 


cgt 


ccc 


agg tct 


atg ctg 


cgc tea tac 


aga 


cag 


atg 


691 


Arg Arg Val 


Gly 


Arg 


Pro 


Arg Ser 


Met Leu 


Arg Ser Tyr 


Arg Gin 


Met 




170 








175 






180 






185 




tec ate 


ate 


tct 


ctg 


get 


tec atg 


aat tct 


gac tgc age 


acc 


ccc 


age 


739 


Ser lie 


He 


Ser 


Leu 
190 


Ala 


Ser Met 


Asn Ser 
195 


Asp Cys Ser 


Thr 


Pro 
200 


Ser 




aag cct 


ace 


tea 


gag 


age 


ttt gac 


ctg gaa 


tta gca tea 


ccc 


aag 


acg 


787 


Lys Pro 


Thr 


Ser 


Glu 


Ser 


Phe Asp 


Leu Glu 


Leu Ala Ser 


Pro Lys 


Thr 








205 








210 




215 








ccg aga gtg 


gag 


cag 


gag 


gaa ccg 


ate tec 


ccg ggg age 


acc 


ctg 


cct 


835 


Pro Arg Val 


Glu 


Gin 


Glu 


Glu Pro 


He Ser 


Pro Gly Ser 


Thr 


Leu 


Pro 






220 








225 




230 










gag gtc 


aag 


ctg 


egg 


agg 


tec aag 


aag agg 


aca aag aga 


age 


age 


gta 


883 


Glu Val 


Lys 


Leu 


Arg 


Arg 


Ser Lys 


Lys Arg 


Thr Lys Arg 


Ser 


Ser 


Val 




235 










240 




245 










gtt ttt 


gcg 


gat 


gag 


aaa 


gca get 


gca gag 


teg gac ctg 


aag 


egg 


ctt 


931 


Val Phe 


Ala 


Asp 


Glu 


Lys 


Ala Ala 


Ala Glu 


Ser Asp Leu 


Lys Arg 


Leu 




250 








255 






260 






265 




tec agg 


aag 


cat 


gag 


ttc 


atg agt 


gac acc 


aac etc teg 


gag 


cat 


gcg 


979 


Ser Arg 


Lys 


His 


Glu 
270 


Phe 


Met Ser 


Asp Thr 
275 


Asn Leu Ser 


Glu 


His 
280 


Ala 




gee ate 


ccc 


etc 


aag 


gcg 


tct gtc 


etc tct 


caa atg age 


ttt 


gec 


age 


1027 


Ala He 


Pro 


Leu 
285 


Lys 


Ala 


Ser Val 


Leu Ser 
290 


Gin Met Ser 


Phe 
295 


Ala 


Ser 




cag tec 


atg 


cct 


acc 


ate 


cca gee 


ctg gcg 


etc tea gtg 


gca 


ggc 


ate 


1075 


Gin Ser 


Met 


Pro 


Thr 


He 


Pro Ala 


Leu Ala 


Leu Ser Val 


Ala Gly 


He 






300 








305 




310 










cct ggg 


ttg 


gat 


gag 


gee 


aac aca 


tct ccc 


cgc etc age 


cag 


acc 


ttc 


1123 


Pro Gly Leu 


Asp 


Glu 


Ala 


Asn Thr 


Ser Pro 


Arg Leu Ser 


Gin 


Thr 


Phe 




315 










n o r\ 
320 




325 










etc caa 


etc 


tea 


gat 


ggt 


gac aag 


aag aca 


etc aca egg 


aag 


aag 


gtc 


1171 


Leu Gin 


Leu 


Ser 


Asp 


Gly 


Asp Lys 


Lys Thr 


Leu Thr Arg 


Lys 


Lys 


Val 




330 








335 






340 






345 




aat cag 


ttc 


ttc 


aag 


aca 


atg ctg 


gee age 


aaa teg get 


gaa 


gaa 


ggc 


1219 


Asn Gin 


Phe 


Phe 


Lys 
350 


Thr 


Met Leu 


Ala Ser 
355 


Lys Ser Ala 


Glu 


Glu 
360 


Gly 




aaa cag 


ate 


cca 


gac 


teg 


ctg tec 


acg gac 


ctg tgagctgctg ctgactaggg 


1272 


Lys Gin 


He 


Pro 
365 


Asp 


Ser 


Leu Ser 


Thr Asp 
370 


Leu 











ctgcatggga gagecaggga ggggagtttc tggaagagga aagccatgcg tggaacatcg 1332 
aagectcaga gagtgggaga ctgtccccat cagttgtcct tacttagagg agacagagag 1392 
gecaatcagg tcccagagct tgaatgctaa caagcccagc atcccctggg gctgtgatca 1452 
tggtggatga ggaagectea aegtagatte ctgaactcaa ggtaccagca agaatgeett 1512 
ctcccagtgt gctctcccca acatcctagg cacagctttc ataacccagt ttcttaggtg 1572 
taagaaactg tttttatctc atttattaag tctcagaact taacagaaaa ggaagccttt 1632 
taaatattct ttttaatttt attttagatt aacagttttg tactttacat ttttttatac 1692 
aaccaaccag tttcttttct agecaatcat ctctgaagag ttgctgtttc ttactgacaa 1752 
taaaaaatgt tctcttggtt caaaaaaaaa aaaaaaa 1789 
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<210> 110 
<211> 386 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> 1. .15 

<400> 110 

Met Cys Leu Leu Leu Ser Cys Pro Cys His Pro Ser Ala His Gly Gin 
-15 -10 -5 1 

Ser Met Trp lie Glu Arg Thr Ser Phe Val Thr Ala Tyr Lys Leu Pro 

5 10 15 

Gly He Leu Arg Trp Phe Glu Val Val His Met Ser Gin Thr Thr He 

20 25 30 

Ser Pro Leu Glu Asn Ala He Glu Thr Met Ser Thr Ala Asn Glu Lys 

35 40 45 

He Leu Met Met He Asn Gin Tyr Gin Ser Asp Glu Thr Leu Pro He 
50 55 60 65 

Asn Pro Leu Ser Met Leu Leu Asn Gly He Val Asp Pro Ala Val Met 

70 75 80 

Gly Gly Phe Ala Lys Tyr Glu Lys Ala Phe Phe Thr Glu Glu Tyr Val 

85 90 95 

Arg Asp His Pro Glu Asp Gin Asp Lys Leu Thr His Leu Lys Asp Leu 

100 105 110 

He Ala Trp Gin He Pro Phe Leu Gly Ala Gly lie Lys He His Glu 

115 120 125 

Lys Arg Val Ser Asp Asn Leu Arg Pro Phe His Asp Arg Met Glu Glu 
130 135 140 ~ 145 

Cys Phe Lys Asn Leu Lys Met Lys Val Glu Lys Glu Tyr Gly Val Arg 

150 155 160 

Glu Met Pro Asp Phe Asp Asp Arg Arg Val Gly Arg Pro Arg Ser Met 

165 170 175 

Leu Arg Ser Tyr Arg Gin Met Ser He He Ser Leu Ala Ser Met Asn 

180 185 190 

Ser Asp Cys Ser Thr Pro Ser Lys Pro Thr Ser Glu Ser Phe Asp Leu 

195 200 205 

Glu Leu Ala Ser Pro Lys Thr Pro Arg Val Glu Gin Glu Glu Pro He 
210 215 220 225 

Ser Pro Gly Ser Thr Leu Pro Glu Val Lys Leu Arg Arg Ser Lys Lys 

230 235 240 

Arg Thr Lys Arg Ser Ser Val Val Phe Ala Asp Glu Lys Ala Ala Ala 

245 250 255 

Glu Ser Asp Leu Lys Arg Leu Ser Arg Lys His Glu Phe Met Ser Asp 

260 265 270 

Thr Asn Leu Ser Glu His Ala Ala He Pro Leu Lys Ala Ser Val Leu 

275 280 285 

Ser Gin Met Ser Phe Ala Ser Gin Ser Met Pro Thr He Pro Ala Leu 
290 295 300 305 

Ala Leu Ser Val Ala Gly He Pro Gly Leu Asp Glu Ala Asn Thr Ser 

310 315 320 

Pro Arg Leu Ser Gin Thr Phe Leu Gin Leu Ser Asp Gly Asp Lys Lys 

325 330 335 

Thr Leu Thr Arg Lys Lys Val Asn Gin Phe Phe Lys Thr Met Leu Ala 

340 345 350 

Ser Lys Ser Ala Glu Glu Gly Lys Gin He Pro Asp Ser Leu Ser Thr 

355 360 365 

Asp Leu 
370 

<210> 111 
<211> 1408 
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<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5'UTR 
<222> 1. .102 

<220> 
<221> CDS 
<222> 103. .1263 

<220> 

<221> 3'UTR 
<222> 1264. .1408 

<220> 

<221> polyA_signal 
<222> 1341. .1346 

<220> 

<221> polyA_site 
<222> 1365. .1408 

<400> 111 

cttcttgact ctctgttcac agaactcagg ctgcctccag ccagcctttg cccgctagac 60 
tcactggccc tgatcacttg aaggtgcagc aagtcactga ga atg age act ttc 114 

Met Ser Thr Phe 

1 



ttc 


teg 


gac 


aca 


gca 


tgg 


ate 


tgc 


ctg 


get 


gtc 


ccc 


aca 


gta 


eta 


tgt 


162 


Phe 


Ser 


Asp 


Thr 


Ala 


Trp 


He 


Cys 


Leu 


Ala 


Val 


Pro 


Thr 


Val 


Leu 


Cys 




5 










10 










15 










20 




ggg 


aca 


gta 


ttt 


tgc 


aaa 


tac 


aag 


aag 


age 


tea 


ggg 


cag 


ctg 


tgg 


age 


210 


Gly 


Thr 


Val 


Phe 


Cys 


Lys 


Tyr 


Lys 


Lys 


Ser 


Ser 


Gly 


Gin 


Leu 


Trp 


Ser 












25 










30 










35 






tgg 


atg 


gtc 


tgc 


ctg 


gca 


ggc 


etc 


tgt 


gca 


gtc 


tgc 


ctg 


etc 


ate 


ctg 


258 


Trp 


Met 


Val 


Cys 


Leu 


Ala 


Gly 


Leu 


Cys 


Ala 


Val 


Cys 


Leu 


Leu 


He 


Leu 










40 










45 










50 








tec 


cct 


ttt 


tgg 


ggc 


ttg 


ate 


etc 


ttc 


teg gtg 


tea 


tgc 


ttc 


etc 


atg 


306 


Ser 


Pro 


Phe 


Trp 


Gly 


Leu 


He 


Leu 


Phe 


Ser 


Val 


Ser 


Cys 


Phe 


Leu 


Met 








55 










60 










65 










tat 


act 


tac 


tta 


tct 


ggc 


caa 


gaa 


ttg 


tta 


cct 


gtg 


gat 


cag 


aag 


gca 


354 


Tyr 


Thr 


Tyr 


Leu 


Ser 


Gly 


Gin 


Glu 


Leu 


Leu 


Pro 


Val 


Asp 


Gin 


Lys 


Ala 






70 










75 










80 












gtc 


ctg 


gtg 


aca 


ggt 


ggt 


gat 


tgc 


ggg 


ctt 


ggc 


cat 


get 


ttg 


tgc 


aag 


402 


Val 


Leu 


Val 


Thr 


Gly 


Gly 


Asp 


Cys 


Gly 


Leu Gly 


His 


Ala 


Leu 


Cys 


Lys 




85 










90 










95 










100 




tat 


ctg 


gat 


gag 


ctg 


ggc 


ttc 


acg 


gta 


ttt 


gec 


gga 


gtt 


ttg 


aat 


gaa 


450 


Tyr 


Leu 


Asp 


Glu 


Leu 


Gly 


Phe 


Thr 


Val 


Phe 


Ala 


Gly 


Val 


Leu 


Asn 


Glu 












105 










110 










115 






aat 


ggc 


cca 


gga 


get 


gag 


gaa 


ttg 


cga 


aga 


acc 


tgc 


tct 


ccg 


cgc 


etc 


498 


Asn 


Gly 


Pro 


Gly 


Ala 


Glu 


Glu 


Leu 


Arg 


Arg Thr 


Cys 


Ser 


Pro 


Arg 


Leu 










120 










125 










130 








teg 


gtg 


etc 


caa 


atg 


gac 


ate 


acg 


aag 


cca gtg 


cag 


ata 


aaa 


gat 


get 


546 


Ser 


Val 


Leu 


Gin 


Met 


Asp 


He 


Thr 


Lys 


Pro 


Val 


Gin 


He 


Lys 


Asp 


Ala 








135 










140 










145 










tac 


age 


aag 


gtt 


gca 


gca 


atg 


ctg 


cag 


gac 


aga 


gga 


ctg 


tgg 


get 


gtg 


594 


Tyr 


Ser 


Lys 


Val 


Ala 


Ala 


Met 


Leu 


Gin 


Asp Arg 


Gly 


Leu 


Trp 


Ala 


Val 






150 










155 










160 












ate 


aac 


aat 


get 


ggg 


gtg 


ctt 


ggc 


ttt 


cca 


act 


gat 


ggg 


gag 


ctt 


ctt 


642 


He 


Asn 


Asn 


Ala 


Gly 


Val 


Leu 


Gly 


Phe 


Pro 


Thr 


Asp 


Gly 


Glu 


Leu 


Leu 




165 










170 










175 










180 




ctt 


atg 


act 


gac 


tac 


aaa 


caa 


tgc 


atg 


gec 


gtg 


aac 


ttc 


ttt 


gga 


act 


690 


Leu 


Met 


Thr 


Asp 


Tyr 


Lys 


Gin 


Cys 


Met 


Ala 


Val 


Asn 


Phe 


Phe 


Gly 


Thr 
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185 190 195 



gtg 


gag 


gtc 


aca 


aag 


acg 


4-4-4- 
ttt 


4- 4-/-v 

ttg 


_ _ 
cct 


— 4-4- 
Ctt 


— 4-4- 
Ctt 


aga 


aaa 


tec 


aaa 


ggg 


TOD 

/Jo 


v aj. 


OX U. 


Val 
val 


XI1X 


Lys 


Thr* 

x IIX 




Leu 


Pro 


Leu 


Leu 


Arg 


Lys 


Ser 


Lys 


UtJLy 




















•5 n 

Z U 3 










9i f\ 
z xu 








a 99 


ctg 


gtg 


aat 


gtc 


age 


age 


atg 


gga 


gg a 


ggg 


gee 


cca 


gtg 


gaa 


agg 


n ft c 

/DO 


Arg 


Leu 


Val 




Val 
val 


OCX. 


OCX. 


1*1C L. 


y 


uxy 


fll v 

J: 


ai a 


Pro 


Val 


uj. IX 


Arg 




























Z Z O 














tct 


hah 


ggc 


tea 


4- -a 


aag 


gcg 


get 


gtg 


ace 


atg 


4-4-/-. 


tea 


4- na 

tea 


ft "XA 
O J 


Leu 


Ala 


SG2T 






Ser 


OCX 


T,vC 

xjy t> 




Ala 


Val 

V exx 


X Hi 


I'ic L- 


pVift 
JCllC 


Cor 

OCX 


Oft ■>»■ 

OCX 






Z J U 










ZOO 










0 a a 

Zft U 












gtt 


atg 


aga 


ctg 


gag 


ctt 


tec 


aag 


tgg 


gga 


— 4-4- 

att 


aaa 


gtt 


get 


tec 


— 4- _ 

ate 


00Z 


Vet J. 




Arg 


Leu 


r*i n 

VjIU 


Leu 


Ser 


Lys 


Trp 


oiy 


Tl ft 

lie 


Lys 


vai 


Aia 


Oft v 


Tic* 

lie 




Z*±D 










OCA 

/SOU 










ZDb 










O *T A 




CEcL 


cct 


gga 




4- 4- y-» 

etc 


eta 


aca 


aat 


p— km s** 

ate 


gca 


ggc 


acc 


agt 


gac 


aag 


tgg 


QO A 


m n 

V7j.Il 


Pro 


oiy 


oiy 


xriie 


Leu 


mr 


As n 


lie 


Aia 


Giy 


inr 


Ser 


Asp 


Lys 


Trp 












O £ C 

z ob 










z /0 










275 






gaa 


aag 


ctg 


gag 


aag 


gac 


att 


ctg 


gac 


eac 


etc 


ccc 


get 


gag 


gta 


cag 


978 


vjXU 


Lys 


Leu 


r*l ii 


Lys 


Asp 


Tift 

lie 


Leu 


Asp 


tin a 
HIS 


T rtn 

lieu 


Fro 


7\1 -. 

Ala 


bill 


val 


Gin 










nn/\ 

2 oO 










O n r 

285 










290 








gaa 


gac 


tac 


tgc 


cag 


gac 


tac 


ate 


tta 


gca 


cag 


egg 


aat 


ttc 


etc 


eta 


1026 


Olll 


Asp 


Tyr 


Cys 


r»l -ri 

bin 


Asp 


Tyr 


lie 


Leu 


Aia 


GUI 


Arg 


7\ rift 

AS 11 


pne 


Leu 


Leu 








"5QC 




















0 a rz 










f. 4- 

ttg 


ate 


aac 


teg 


tea 


gee 


age 


aag 


gac 


4-4- n 

ttc 


tct 


ccg 


gtg 


ctg 


egg 


gac 


1074 


Leu 


lie 


As n 


Ser 


Leu 


Aia 


Ser 


Lys 


Asp 


pne 


Ser 


Pro 


val 


Leu 


Arg 


Asp 
















lie 
jib 










•"ton 
JzO 












dLC 


cag 


cat 


get 


ate 


ttg 


gcg 


aag 


age 


cct 


4-4-4- 
ttt 


gee 


tat 


tac 


acg 


cca 


1122 


Tl o 

.lie 


fl n 

Gin 


XllS 


ax a 


Tl A 

lie 


Leu 


7Vl <?» 

Aia 


Lys 


Ser 


Pro 


pne 


Ala 


Tyr 


Tyr 


Thr 


Pro 




325 










J J w 










j 3 3 














ggg 


aaa 


ggc 


get 


tac 


ttg 


tgg 


ate 


tgc 


ctt 


get 


cac 


tat 


ttg 


cct 


att 


1170 


Gly 


Lys 


Gly 


Ala 


Tyr 


Leu 


Trp 


lie 


Cys 


Leu 


Ala 


His 


Tyr 


Leu 


Pro 


He 












345 










350 










355 






ggc 


ata 


tat 


gat 


tac 


ttt 


get 


aaa 


aga 


cat 


ttt 


ggc 


caa 


gac 


aag 


ccc 


1218 


Gly 


lie 


Tyr 


Asp 


Tyr 


Phe 


Ala 


Lys 


Arg 


His 


Phe 


Gly 


Gin 


Asp 


Lys 


Pro 










360 










365 










370 








atg 


ccc 


aga 


get 


tta 


aga 


atg 


cct 


aac 


tac 


aag 


aaa 


aag 


gee 


ccc 




1263 


Met 


Pro 


Arg 


Ala 


Leu 


Arg 


Met 


Pro 


Asn 


Tyr 


Lys 


Lys 


Lys 


Ala 


Pro 







375 380 385 



taggcaatgg aagccctcaa agaagtegga atgtcatagt cttgaaatga aagggaaact 1323 
gggaaattgg gtttctcatt aaagttgttt cccactctgt waaaaaaaaa aaaaaaaaaa 1383 
aaaaaaaaga aaaaaaaaaa aaaaa 1408 

. <210> 112 
<211> 387 
<212> PRT 
<213> Homo sapiens 



<400> 112 



Met 


Ser 


Thr 


Phe 


Phe 


Ser 


Asp 


Thr 


Ala 


Trp 


He 


Cys 


Leu 


Ala 


Val 


Pro 


1 








5 










10 










15 




Thr 


Val 


Leu 


Cys 
20 


Gly 


Thr 


Val 


Phe 


Cys 
25 


Lys 


Tyr 


Lys 


Lys 


Ser 
30 


Ser 


Gly 


Gin 


Leu 


Trp 
35 


Ser 


Trp 


Met 


Val 


Cys 
40 


Leu 


Ala 


Gly 


Leu 


Cys 
45 


Ala 


Val 


Cys 


Leu 


Leu 
50 


He 


Leu 


Ser 


Pro 


Phe 
55 


Trp 


Gly 


Leu 


He 


Leu 
60 


Phe 


Ser 


Val 


Ser 


Cys 


Phe 


Leu 


Met 


Tyr 


Thr 


Tyr 


Leu 


Ser 


Gly 


Gin 


Glu 


Leu 


Leu 


Pro 


Val 


65 










70 










75 










80 


Asp 


Gin 


Lys 


Ala 


Val 
85 


Leu 


Val 


Thr 


Gly 


Gly 
90 


Asp 


Cys 


Gly 


Leu 


Gly 
95 


His 


Ala 


Leu 


Cys 


Lys 
100 


Tyr 


Leu 


Asp 


Glu 


Leu 
105 


Gly 


Phe 


Thr 


Val 


Phe 
110 


Ala 


Gly 


Val 


Leu 


Asn 
115 


Glu 


Asn 


Gly 


Pro 


Gly 
120 


Ala 


Glu 


Glu 


Leu 


Arg 
125 


Arg 


Thr 


Cys 
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Ser Pro Ara TiRu Ser Val 


Leu 


Gin 


Met 


130 


135 






He Tivs Asn Ala Tvr Ser 


Lys 


Val 


Ala 


145 150 
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